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GROUP B STREPTOCOCCUS ANTIGENS 

5 

FIELD OF THE INVENTION 

The present invention is related to antigens, more 
particularly protein antigens of group B streptococcus (GBS) 
10 bacterial pathogen which are useful as vaccine components 
for therapy and/or prophylaxis. 

BACKGROUND OF THE INVENTION 

15 

Streptococcus are gram ( + ) bacteria that are differentiated 
by group specific carbohydrate antigens A through O found on 
their cell surface. Streptococcus groups are further 
distinguished by type-specific capsular polysaccharide 
20 antigens. Several serotypes have been identified for the 
Group B streptococcus (GBS) : la, lb, II, III, IV, V, VI, 
VII and VIII. GBS also contains antigenic proteins known as 
"C-proteins" (alpha, beta, gamma and delta) , some of which 
have been cloned. 

25 

Although GBS is a common component of the normal human 
vaginal and colonic flora this pathogen has long been 
recognized as a major cause of neonatal sepsis and 
meningitis, late-onset meningitis in infants, postpartum 

30 endometritis as well as mastitis in dairy herds. Expectant 
mothers exposed to GBS are at risk of postpartum infection 
and may transfer the infection to their baby as the child 
passes through the birth canal . Although the organism is 
sensitive to antibiotics, the high attack rate and rapid 

35 onset of sepsis in neonates and meningitis in infants 
results in high morbidity and mortality. 

1 
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To find a vaccine that will protect individuals from GBS 
infection, researches have turned to the type-specific 
antigens. Unfortunately these polysaccharides have proven to 
5 be poorly immunogenic in humans and are restricted to the 
particular serotype from which the polysaccharide 
originates. Further, capsular polysaccharide elicit a T 
cell independent response i.e. no IgG production. 
Consequently capsular polysaccharide antigens are unsuitable 
10 as a vaccine component for protection against GBS infection. 

Others have focused on the C-protein beta antigen which 
demonstrated immunogenic properties in mice and rabbit 
models . This protein was found to be unsuitable as a human 

15 vaccine because of its undesirable property of interacting 
with high affinity and in a non- immunogenic manner with the 
Fc region of human IgA. The C-protein alpha antigen is rare 
in type III serotypes of GBS which is the serotype 
responsible for most GBS mediated conditions and is 

20 therefore of little use as a vaccine component. 

Therefore there remains an unmet need for GBS antigens that 
may be used as vaccine components for the prophylaxis and/or 
25 therapy of GBS infection. 

SUMMARY OF THE INVENTION 

30 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence selected from the group consisting of: 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO : 4, SEQ ID NO : 5, 

35 SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
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SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 

SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29, 

SEQ ID NO: 30, SEQ ID NO : 3 1 , SEQ ID NO: 33, SEQ ID NO: 34, 

SEQ ID NO: 35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 

5 SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or fragments, 
analogs or derivatives thereof. 

In other aspects, there is provided vectors comprising 
polynucleotides of the invention operably linked to an 
10 expression control region, as well as host cells transfected 
with said vectors and methods of producing polypeptides 
comprising culturing said host cells under conditions 
suitable for expression. 

15 In yet another aspect, there is provided novel polypeptides 
encoded by polynucleotides of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 

Figure la is the DNA sequence of clone 1 (SEQ ID NO :1) with 
corresponding amino acid sequences for open reading frames; 

figure lb is the amino acid sequence SEQ ID NO: 2; 

figure 1c is the amino acid sequence SEQ ID NO: 3; 

2'5 figure Id is the amino acid sequence SEQ ID NO: 4; 

figure le is the amino acid sequence SEQ ID NO: 5; 

figure If is the amino acid sequence SEQ ID NO: 6; 

Figure 2a is the DNA sequence of clone 2 (SEQ ID NO :7) with 
30 corresponding amino acid sequences for open reading frames; 

figure 2b is the amino acid sequence SEQ ID NO: 8; 

figure 2c is the amino acid sequence SEQ ID NO: 9; 

figure 2d is the amino acid sequence SEQ ID NO: 10; 

figure 2e is the amino acid sequence SEQ ID NO: 11; 

35 figure 2f is the amino acid sequence SEQ ID NO: 12; 
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Figure 3a is the DNA sequence of clone 3 (SEQ ID NO :13) 
with corresponding amino acid sequences for open reading 





frames ; 
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21 
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20 



Figure 4a is the DNA sequence of clone 4 (SEQ ID NO :22) 
with corresponding amino acid sequences for open reading 
frames / 

figure 4b is the amino acid sequence SEQ ID NO: 23 

figure 4c is the amino acid sequence SEQ ID NO: 24, 

figure 4d is the amino acid sequence SEQ ID NO: 25, 

figure 4e is the amino acid sequence SEQ ID NO: 26, 

Figure 5a is the DNA sequence of clone 5 (SEQ ID NO :27) 
with corresponding amino acid sequences for open reading 





frames ; 
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35 
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32) 
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figure 7c is the amino acid sequence SEQ ID NO: 39; 
figure 7d is the amino acid sequence SEQ ID NO: 40; 
figure 7e is the amino acid sequence SEQ ID NO: 41; 

Figure 8 is the DNA sequence of a part of clone 7 including 
a signal sequence (SEQ ID NO :42) ; 

Figure 9 is the DNA sequence of a part of clone 7 without a 
signal sequence (SEQ ID NO :43) ; 

Figure 9a is the amino acid sequence (SEQ ID NO:44); 

Figure 10 represents the distribution of anti-GBS ELISA 
titers in sera from CD-I mice immunized with recombinant GBS 
protein corresponding to the SEQ ID NO: 39. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to novel antigenic 
polypeptides of group B streptococcus (GBS) characterized by 
5 the amino acid sequence selected from the group consisting 
of : 
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ID 


NO: 


2, 


SEQ 


ID 


NO: 


3, 


SEQ 


ID 


NO: 


4, 


SEQ 


ID 


NO : 5 , 
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ID 
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6, 


SEQ 
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NO: 
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NO: 


9, 
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NO: 10, 
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ID 


NO: 


11, 
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14, 


SEQ 


ID 


NO: 15, 


10 


SEQ 


ID 


NO: 


16, 


SEQ 


ID 


NO: 
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SEQ 
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NO: 
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ID 


NO: 19, 
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NO: 


33 , 
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NO: 34, 
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ID 


NO: 


35, 


SEQ 


ID 


NO: 


36, 


SEQ 


ID 


NO: 


38, 


SEQ 


ID 


NO: 39, 


15 


SEQ 


ID 


NO: 


40, 


SEQ 


ID 


NO: 


41 


and SEQ 


ID 


NO: 


44 or fragments, 



analogs or derivatives thereof. 



A preferred embodiment of the invention includes SEQ ID 
NO :39 and SEQ ID NO: 44. 

20 

A further preferred embodiment of the invention is SEQ ID 
NO :39. 

A further preferred embodiment of the invention is SEQ ID 
25 NO : 44. 

As used herein, "fragments", "derivatives" or "analogs" of 
the polypeptides of the invention include those polypeptides 
in which one or more of the amino acid residues are 
30 substituted with a conserved or non-conserved amino acid 

residue (preferably conserved) and which may be natural or 
unnatural . 

The terms «fragments», «derivatives» or «analogues» of 
35 polypeptides of the present invention also include 

polypeptides which are modified by addition, deletion, 
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substitution of amino acids provided that the polypeptides 
retain the capacity to induce an immune response. 

By the term «conserved amino acid» is meant a substitution 
5 of one or more amino acids for another in which the 

antigenic determinant (including its secondary structure and 
hydropathic nature) of a given antigen is completely or 
partially conserved in spite of the substitution. 

10 For example, one or more amino acid residues within the 
sequence can be substituted by another amino acid of a 
similar polarity, which acts as a functional equivalent, 
resulting in a silent alteration. Substitutes for an amino 
acid within the sequence may be selected from other members 

15 of the class to which the amino acid belongs. For example, 
the nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, 

20 asparagine and glutamine . The positively charged (basic) 
amino acids include arginine, lysine and histidine. The 
negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. 

25 Preferably, derivatives and analogs of polypeptides of the 
invention will have about 70% identity with those sequences 
illustrated in the figures or fragments thereof. That is, 
70% of the residues are the same. More preferably 
polypeptides will have greater than 95% homology. In another 

30 preferred embodiment, derivatives and analogs of 

polypeptides of the invention will have fewer than about 2 0 
amino acid residue substitutions, modifications or deletions 
and more preferably less than 10. Preferred substitutions 
are those known in the art as conserved i.e. the substituted 

35 residues share physical or chemical properties such as 
hydrophobicity, size, charge or functional groups. 
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Furthermore, in those situations where amino acid regions 
are found to be polymorphic, it may be desirable to vary one 
or more particular amino acids to more effectively mimic the 
5 different epitopes of the different GBS strains. 

Also included are polypeptides which have fused thereto 
other compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) to 
10 increase half -life; leader or secretory amino acid sequences 
for ease of purification; prepro- and pro- sequences; and 
(poly) saccharides . 

Moreover, the polypeptides of the present invention can be 
15 modified by terminal -NH 2 acylation (eg. by acetylation, or 
thioglycolic acid amidation, terminal carbosy amidation, 
e.g. with ammonia or methylamine) to provide stability, 
increased hydrophobicity for linking or binding to a support 
or other molecule. 

20 

Also contemplated are hetero and homo polypeptide multimers 
of the polypeptide fragments, analogues and derivatives. 
These polymeric forms include, for example, one or more 
polypeptides that have been cross-linked with cross-linkers 

25 such as avidin/biotin, gluteraldehyde or dimethyl - 
superimidate . Such polymeric forms also include 
polypeptides containing two or more tandem or inverted 
contiguous sequences, produced from multicistronic mRNAs 
generated by recombinant DNA technology. 

30 Preferably, a fragment, analog or derivative of a 

polypeptide of the invention will comprise at least one 
antigenic region i.e. at least one epitope. 

In order to achieve the formation of antigenic polymers 
35 (i.e. synthetic multimers), polypeptides may be utilized 

having bishaloacetyl groups, nitroarylhalides , or the like, 
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where the reagents being specific for thio groups. 
Therefore, the link between two mercapto groups of the 
different peptides may be a single bond or may be composed 
of a linking group of at least two, typically at least four, 
5 and not more than 16, but usually not more than about 14 
carbon atoms . 

In a particular embodiment, polypeptide fragments, analogs 
and derivatives of the invention do not contain a methionine 

10 (Met) starting residue. Preferably, polypeptides will not 
incorporate a leader or secretory sequence (signal 
sequence) . The signal portion of a polypeptide of the 
invention may be determined according to established 
molecular biological techniques. In general, the 

15 polypeptide of interest may be isolated from a GBS culture 
and subsequently sequenced to determine the initial residue 
of the mature protein and therefor the sequence of the 
mature polypeptide. 

20 According to another aspect, there is provided vaccine 

compositions comprising one or more GBS polypeptides of the 
invention in admixture with a pharmaceutically acceptable 
carrier diluent or adjuvant. 

25 Suitable adjuvants include oils i.e. Freund's complete or 
incomplete adjuvant; salts i.e. A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , 
AlNH 4 (S0 4 ) 2 , Al(OH) 3 , A1P0 4/ silica, kaolin; saponin 
derivative; carbon polynucleotides i.e. poly IC and poly AU 
and also detoxified cholera toxin (CTB)and E.coli heat 

30 labile toxin for induction of mucosal immunity. Preferred 
adjuvants include QuilA™, Alhydrogel™ and Adjuphos™. 
Vaccines of the invention may be administered parenterally 
by injection, rapid infusion, nasopharyngeal absorption, 
dermoabsorption, or bucal or oral. 
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Vaccine compositions of the invention are used for the 
treatment or prophylaxis of streptococcus infection and/or 
diseases and symptoms mediated by streptococcus infection, 
5 in particular group A streptococcus (pyogenes) , group B 
streptococcus (GBS or agalactiae) , dysgalactiae , uJberis, 
nocardia as well as Staphylococcus aureus. General 
information about Streptococcus is available in Manual of 
Clinical Microbiology by P.R.Murray et 31.(1995, 6 th Edition, 

10 ASM Press, Washington, D.C.). More particularly group B 
streptococcus, agalactiae. In a particular embodiment 
vaccines are administered to those individuals at risk of 
GBS infection such as pregnant women and infants for sepsis, 
meningitis and pneumonia as well as immunocompromised 

15 individuals such as those with diabetes, liver disease or 

cancer. Vaccines may also have veterinary applications such 
as for the treatment of mastitis in cattle which is mediated 
by the above mentioned bacteria as well as E.coli. 

20 The vaccine of the present invention can also be used for 
the manufacture of a medicament used for the treatment or 
prophylaxis of streptococcus infection and/or diseases and 
symptoms mediated by streptococcus infection, in particular 
group A streptococcus (pyogenes) , group B streptococcus (GBS 

25 or agalactiae) , dysgalactiae , uteris, nocardia as well as 
Staphylococcus aureus. More particularly group B 
streptococcus , agalactiae. 

Vaccine compositions are preferably in unit dosage form of 
30 about 0.001 to 100 fig /kg (antigen/body weight) and more 
preferably 0.01 to 10 ug/kg and most preferably 0.1 to 1 
H-g/kg 1 to 3 times with an .interval of about 1 to 12 weeks 
intervals between immunizations, and more preferably 1 to 6 
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10 



15 



25 



30 



35 



weeks . 

According to another aspect, there is provided 
polynucleotides encoding polypeptides of group B 
streptococcus (GBS) characterized by the amino acid sequence 
selected from the group consisting of : 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or fragments, 
analogs or derivatives thereof. 



Preferred polynucleotides are those illustrated in figures 
la (SEQ ID NO: 1), 2a (SEQ ID NO: 7), 3a (SEQ ID NO: 13), 4a 
20 (SEQ ID NO: 22), 5a (SEQ ID NO: 27), 6a (SEQ ID NO: 32), 7a 
(SEQ ID NO: 37), 8 (SEQ ID NO : 42) and 9 (SEQ ID NO : 43) 
which correspond to the open reading frames, encoding 
polypeptides of the invention. 



Preferred polynucleotides are those illustrated in figures 
la (SEQ ID NO: 1), 2a (SEQ ID NO: 7), 3a (SEQ ID NO: 13), 4a 
(SEQ ID NO: 22), 5a (SEQ ID NO: 27), 6a (SEQ ID NO: 32), 7a 
(SEQ ID NO: 37), 8 (SEQ ID NO : 42) and 9 (SEQ ID NO : 43) 
and fragments, analogues and derivatives thereof. 

More preferred polynucleotides of the invention are those 
illustrated in Figures 7 (SEQ ID NO : 37) , 8 (SEQ ID NO : 
42) and 9 (SEQ ID NO : 43). 

Most preferred polynucleotides of the invention are those 
illustrated in Figures 8 (SEQ ID NO : 42) and 9 (SEQ ID NO : 

11 
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43) . 

It will be appreciated that the polynucleotide sequences 
illustrated in the figures may be altered with degenerate 
5 codons yet still encode the polypeptides of the invention. 

Due to the degeneracy of nucleotide coding sequences, other 
polynucleotide sequences which encode for substantially the 
same polypeptides of the present invention may be used in 
10 the practice of the present invention. These include but are 
not limited to nucleotide sequences which are altered by the 
substitution of different codons that encode the same amino 
acid residue within the sequence, thus producing a silent 
change . 

15 

Accordingly the present invention further provides 
polynucleotides which hybridize to the polynucleotide 
sequences herein above described (or the complement 
sequences thereof) having 50% and preferably at least 70% 
20 identity between sequences. More preferably polynucleotides 
are hybridizable under stringent conditions i.e. having at 
least 95% identity and most preferably more than 97% 
identity. 

25 By capable of hybridizing under stringent conditions is 
meant annealing of a nucleic acid molecule to at least a 
region of a second nucleic acid sequence (whether as cDNA, 
mRNA, or genomic DNA) or to its complementary strand under 
standard conditions, e.g. high temperature and/or low salt 

30 content, which tend to disfavor hybridization of 

noncomplementary nucleotide sequences. A suitable protocol 
is described in Maniatis T. et al . , Molecular cloning : A 
Laboratory Manual, Cold Springs Harbor Laboratory, 1982, 
which is herein incorporated by reference. 

35 

In a further aspect, polynucleotides encoding polypeptides 

12 
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of the invention, or fragments, analogs or derivatives 
thereof, may be used in a DNA immunization method. 
That is, they can be incorporated into a vector which is 
replicable and expressible upon injection thereby producing 
5 the antigenic polypeptide in vivo. For example 

polynucleotides may be incorporated into a plasmid vector 
under the control of the CMV promoter which is functional in 
eukaryotic cells. Preferably the vector is injected 
intramuscularly . 

10 

According to another aspect, there is provided a process for 
producing polypeptides of the invention by recombinant 
techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
15 polypeptide product. Alternatively, the polypeptides can be 
produced according to established synthetic chemical 
techniques i.e. solution phase or solid phase synthesis of 
oligopeptides which are ligated to produce the full 
polypeptide (block ligation) . 

20 

For recombinant production, host cells are transfected with 
vectors which encode the polypeptide, and then cultured in a 
nutrient media modified as appropriate for activating 
promoters, selecting transf ormants or amplifying the genes. 

25 Suitable vectors are those that are viable and replicable in 
the chosen host and include chromosomal, non- chromosomal and 
synthetic DNA sequences e.g. bacterial plasmids, phage DNA, 
baculovirus, yeast plasmids, vectors derived from 
combinations of plasmids and phage DNA. The polypeptide 

30 sequence may be incorporated in the vector at the 

appropriate site using restriction enzymes such that it is 
operably linked to an expression control region comprising a 
promoter, ribosome binding site (consensus region or Shine- 
Dalgarno sequence) , and optionally an operator (control 

35 element) . One can select individual components of the 

expression control region that are appropriate for a given 
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host and vector according to established molecular biology 
principles (Sambrook et al, Molecular Cloning: A Laboratory 
Manual, 2nd ed. , Cold Spring Harbor, N.Y., 1989 incorporated 
herein by reference) . Suitable promoters include but are not 
5 limited to LTR or SV40 promoter, E.coli lac, tac or trp 
promoters and the phage lambda P L promoter. Vectors will 
preferably incorporate an origin of replication as well as 
selection markers i.e. ampicillin resistance gene. Suitable 
bacterial vectors include pET, pQE70, pQE60, pQE-9, pbs, 

10 pDIO phagescript, psiX174, pbluescript SK, pbsks, pNH8A, 

pNH16a, pNH18A, pNH46A, ptrc99a, pKK223-3, pKK233~3, pDR540, 
pRITB and eukaryotic vectors pBlueBacIII, pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG, pSVK3, pBPV, pMSG and pSVL. Host cells 
may be bacterial i.e. E.coli, Bacillus subtilis, 

15 Streptomyces; fungal i.e. Aspergillus niger, Aspergillus 
nidulins ; yeast i.e. Saccharomyces or eukaryotic i.e. CHO, 
COS. 

Upon expression of the polypeptide in culture, cells are 
20 typically harvested by centrif ugation then disrupted by 

physical or chemical means (if the expressed polypeptide is 
not secreted into the media) and the resulting crude extract 
retained to isolate the polypeptide of interest . 
Purification of the polypeptide from culture media or lysate 
25 may be achieved by established techniques depending on the 

properties of the polypeptide i.e. using ammonium sulfate or 
ethanol precipitation , acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, hydroxy 1 apatite 
30 chromatography and lectin chromatography. Final 
purification may be achieved using HPLC . 

The polypeptide may be expressed with or without a leader or 
secretion sequence. In the former case the leader may be 
35 removed using post-translational processing (see US 
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4,431,739; 4,425,437; and 4,338,397 incorporated herein by 
reference) or be chemically removed subsequent to purifying 
the expressed polypeptide. 

5 According to a further aspect, the GBS polypeptides of the 
invention may be used in a diagnostic test for streptococcus 
infection in particular GBS infection. Several diagnostic 
methods are possible, for example detecting streptococcus 
organism in a biological sample, the following procedure may 
10 be followed: 

a) obtaining a biological sample from a patient; 

b) incubating an antibody or fragment thereof reactive 
with a GBS polypeptide of the invention with the 
biological sample to form a mixture; and 

15 c) detecting specifically bound antibody or bound fragment 
in the mixture which indicates the presence of 
streptococcus . 

Alternatively, a method for the detection of antibody 
specific to a streptococcus antigen in a biological sample 
containing or suspected of containing said antibody may be 
performed as follows : 

a) isolating a biological sample from a patient; 

b) incubating one or more GBS polypeptides of the 
invention or fragments thereof with the biological 
sample to form a mixture; and 

c) detecting specifically bound antigen or bound fragment 
in the mixture which indicates the presence of antibody 
specific to streptococcus. 

One of skill in the art will recognize that this diagnostic 
test may take several forms, including an immunological test 
such as an enzyme-linked immunosorbent assay (ELISA) , a 
radioimmunoassay or a latex agglutination assay, essentially 
to determine whether antibodies specific for the protein are 
present in an organism. 
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The DNA sequences encoding polypeptides of the invention may 
also be used to design DNA probes for use in detecting the 
presence of streptococcus in a biological sample suspected 
5 of containing such bacteria. The detection method of this 
invention comprises : 

a) isolating the biological sample from a patient; 

b) incubating one or more DNA probes having a DNA sequence 
encoding a polypeptide of the invention or fragments 

10 thereof with the biological sample to form a mixture; 

and 

c) detecting specifically bound DNA probe in the mixture 
which indicates the presence of streptococcus bacteria. 

15 The DNA probes of this invention may also be used for 

detecting circulating streptococcus i.e. GBS nucleic acids 
in a sample, for example using a polymerase chain reaction, 
as a method of diagnosing streptococcus infections. The 
probe may be synthesized using conventional techniques and 

20 may be immobilized on a solid phase, or may be labeled with 
a detectable label. A preferred DNA probe for this 
application is an oligomer having a sequence complementary 
to at least about 6 contiguous nucleotides of the GBS 
polypeptides of the invention. 

25 

Another diagnostic method for the detection of streptococcus 
in a patient comprises : 

a) labeling an antibody reactive with a polypeptide of the 

invention or fragment thereof with a detectable label; 
30 b) administering the labeled antibody or labeled fragment 

to the patient; and 
c) detecting specifically bound labeled antibody or 

labeled fragment in the patient which indicates the 

presence of streptococcus. 



35 
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polypeptides of the invention as immunogens for the 
production of specific antibodies for the diagnosis and in 
particular the treatment of streptococcus infection. 
Suitable antibodies may be determined using appropriate 
5 screening methods, for example by measuring the ability of a 
particular antibody to passively protect against 
streptococcus infection in a test model . One example of an 
animal model is the mouse model described in the examples 
herein. The antibody may be a whole antibody or an antigen- 

10 binding fragment thereof and may in general belong to any 
immunoglobulin class. The antibody or fragment may be of 
animal origin, specifically of mammalian origin and more 
specifically of murine, rat or human origin. It may be a 
natural antibody or a fragment thereof, or if desired, a 

15 recombinant antibody or antibody fragment. The term 

recombinant antibody or antibody fragment means antibody or 
antibody fragment which were produced using molecular 
biology techniques. The antibody or antibody fragments may 
be polyclonal, or preferably monoclonal. It may be specific 

20 for a number of epitopes associated with the GBS 
polypeptides but is preferably specific for one. 

EXAMPLE 1 Murine model of lethal Group B Streptococcus (GBS) 
25 infection 

The mouse model of GBS infection is described in detail in 
Lancefield et al (J Exp Med 142:165-179,1975). GBS strain 
C388/90 (Clinical isolate obtained in 1990 from the 

30 cephalorachidian fluid of a patient suffering from 

meningitis, Children's Hospital of Eastern Ontario, Ottawa, 
Canada) and NCS246 (National Center for Streptococcus, 
Provincial Laboratory of Public Health for Northern Alberta, 
Edmonton, Canada) were respectively serotyped as type Ia/c 

35 and type II/R. 
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To increase their virulence, the GBS strains C388/90 
(serotype Ia/c) and NCS 246 (serotype II/R) were serially 
passaged through mice as described previously (Lancefield et 
al. J Exp Med 142:165-179, 1975). Briefly, the increase of 
5 virulence was monitored using intraperitoneal inoculations 
of serial dilutions of a subculture in Todd-Hewitt broth 
obtained from either the blood or spleen of infected mice. 
After the last passage, infected blood samples were used to 
inoculate Todd-Hewitt broth. After an incubation of 2 hours 

10 at 37°C with 7% C0 2 , glycerol at a final concentration of 
10% (v/v) was added to the culture. The culture was then 
aliquoted and stored at -80° C for use in GBS challenge 
experiments . The number of cf u of GBS present in these 
frozen samples was determined. The bacterial concentration 

15 necessary to kill 100% (LD100) of the 18 weeks old mice were 
determined to be 3.5X10 5 and 1.1X10 5 respectively for GBS 
strain C388/90 and NCS246, which corresponded to a 
significant increase in virulence for both strains. Indeed, 
the LD10 0 recorded before the passages for these two strains 

20 was higher than 10 9 cfu. 

In a bacterial challenge, a freshly thawed aliquot of a 
virulent GBS strain was adjusted to the appropriate 
bacterial concentration using Todd-Hewitt broth and 1ml was 

25 injected intraperitoneally to each female CD-I mouse. The 

mice used for the passive protection experiments were 6 to 8 
weeks old, while the ones used for the active protection 
experiments were approximately 18 weeks old at the time of 
the challenge. All inocula were verified by colony counts. 

30 Animals were observed for any sign of infection four times 
daily for the first 48 h after challenge and then daily for 
the next 12 days. At the end of that period, blood samples 
were obtained from the survivors and frozen at -2 0°C. The 
spleen obtained from each mouse that survived the challenge 

35 was cultured in order to identify any remaining GBS. 
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EXAMPLE 2 Immunization and protection in mice with 
formaldehyde killed whole GBS cells 

5 Formaldehyde killed GBS whole cells were prepared according 
to the procedures described in Lancefield et al (J Exp Med 
142:165-179,1975). Briefly, an overnight culture on sheep 
blood agar plates (Quelab Laboratories, Montreal, Canada) of 
a GBS strain was washed twice in PBS buffer (phosphate 
10 buf fered-saline, pH7.2), adjusted to approximately 3X10 9 

cfu/mL and incubated overnight in PBS containing 0.3% (v/v) 
formaldehyde. The killed GBS suspension was washed with PBS 
and kept frozen at -80°C. 

15 Female CD-I mice, 6 to 8 weeks old (Charles River, St- 
Constant, Quebec, Canada) , were injected subcutaneously 
three times at two weeks interval with 0.1 ml of 
formaldehyde killed cells of GBS strain C388/90 (~6X10 7 GBS) , 
or 0.1 ml of PBS for the control group. On the day before 

20 the immunization, Alhydrogel™ (Superfos Biosector, 

Frederikssund, Denmark) at a final concentration of 0.14 mg 
or 0.21 mg of Al , was added to these preparations and 
incubated overnight at 4°C with agitation. Serum samples 
were obtained from each mouse before the beginning of the 

25 immunization protocol and two weeks after the last 
injection. The sera were frozen at -20°C. 

Eight mice in each control group injected with PBS and the 
group immunized with formaldehyde killed whole cells GBS 

30 strain C388/90 (Ia/c) were challenged with 1.5X10 4 cfu of 
GBS strain C388/90 (Ia/c) one week after the third 
injection. All mice immunized with the formaldehyde killed 
GBS whole cells survived the homologous challenge while, 
within 5 days after the challenge, only 4 out of the 8 mice 

35 injected with PBS survived from the infection. In order to 
increase the mortality rate in the control groups, the 
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bacterial suspension had to be adjusted according to the age 
of the mice at the time of the bacterial challenge. In 
subsequent challenge experiments, when mice were older than 
15 weeks, the bacterial inoculum was increased to 
5 concentrations between 3.0X10 5 and 2.5X10 6 cfu. 
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Table 1 Immunization of CD1 mice with formaldehyde 
killed whole cells of GBS and subsequent 
homologous challenge [strain C388/90 (Ia/c) ] 
and heterologous challenge [strain NCS246 
5 (II/R)]- 



antigenic preparations 
used for immunization 1 


number of living mice 14 days after the bacterial 
challenge 
(% Survival) 


homologous challenge: 
strain C388/90 (la/c) 


heterologous challenge: 
strain NCS246 (ll/R) 


1st infection 


formaldehyde killed cells 
of GBS strain C388/90 
(la/c) 2 


8/8 (100) 3 


n.d. 5 


control PBS 


4/8 (50) 


n.d. \ 


2nd infection 


formaldehyde killed cells 
of GBS strain C388/90 
(la/c) 


6/6 (100) 4 


0/6 (0) 6 


control PBS 


2/6 (33) 


0/6 (0) 



1 alhydrogei™ at a final concentration of 0.14 mg or 0.21 mg of Al was used; 

2 approximately 6X1 0 7 cfu; 

3 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS C388/90 
1 0 (la/c) suspension adjusted to 1 .5X1 0 4 cfu; 

4 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS C388/90 

(la/c) suspension adjusted to 2. 1 X1 0 6 cfu; 

5 not done; 

6 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS NCS246 
15 (ll/R) suspension adjusted to 1.2X10 5 cfu. 

In another experiment, one group of 12 mice corresponding 
to a control group was injected with PBS , while a second 
group of 12 mice was immunized with formaldehyde killed 

20 whole cells of GBS strain C388/90 (Ia/c) . Six mice from 
each of these two groups were challenged with 2.1X10 6 cfu 
of the GBS strain C388/90 (Ia/c) (Table I) . As the first 
challenge experiment, all mice immunized with the GBS 
strain C388/90 (Ia/c) survived the homologous challenge. 

25 Only two out of the 6 mice injected with PBS survived the 
infection . 
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The remaining 6 mice in both groups were then used one 
week later to verify whether this antigenic preparation 
could confer cross protection against strain NCS246 (II/R) 
5 which produce a serologically distinct capsule. None of 
the mice infected with this second GBS strain survived the 
infection. The later result suggested that most of the 
protective immune response induced by formaldehyde killed 
strain C388/90 is directed against the capsular 
10 polysaccharide and that it could be restricted to strains 
of that particular serotype. These results clearly 
indicated that this particular model of infection can be 
efficiently used to study the protection conferred by 
vaccination . 

15 

EXAMPLE 3 Immunization of rabbit with formaldehyde killed 
whole GBS cells and passive protection in mice 

A New Zealand rabbit (2.5 kg, Charles River, St-Constant, 

2 0 Quebec, Canada) was immunized with formaldehyde killed 

cells of GBS strain C388/90 (Ia/c) to obtain hyperimmune 
serum. This rabbit was injected subcutaneously three 
times at three weeks interval with approximately 1.5X10 9 
cfu of formaldehyde killed whole cells of GBS strain 
25 C388/90 (Ia/c). Freund ' s complete adjuvant (Gibco BRL 

Life Technologies, Grand Island, New York) was used as the 
adjuvant for the first immunization, while Freund ' s 
incomplete adjuvant (Gibco BRL) was used for the following 
two injections. Serum samples were obtained before the 

3 0 beginning of the immunization protocol and two weeks after 

the last injection. The sera were frozen at -20°C. 

The ability of this particular rabbit hyperimmune serum to 
passively protect mice against a lethal infection with GBS 
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was also evaluated. Intraperitoneal injection of mice with 
either 15 or 25 /iL of hyperimmune rabbit serum 18 hours 
before the challenge protected 4 out of 5 mice (80%) against 
the infection. Comparatively, survival rates lower than 20% 
were recorded for mice in the control group injected with 
PBS or serum obtained from a rabbit immunized with 
meningococcal outer membrane preparation. This result 
clearly indicates that the immunization of another animal 
species with killed GBS cells can induce the production of 
antibodies that can passively protect mice. This reagent 
will also be used to characterize clones. 

Table 2 Passive protection of CD-I mice conferred by 
rabbit serum obtained after immunization with 
formaldehyde killed group B whole streptococci 
(strain C388/90 (Ia/c) ) antigenic preparation 



groups 


number of living mice 14 
days after the bacterial 
challenge with GBS strain 
C388/90 (Ia/c) 2 


% 

survival 


rabbit hyperimmune 
serum 2 
- 25 fil 


4/5 


80 


rabbit hyperimmune 
serum 1 
- 15 fil 


4/5 


80 


control rabbit serum 
- 25 ill 


1/5 


20 


control PBS 


1/10 


10 



1 Freund's complete adjuvant was used for first 

immunization, and Freund's incomplete adjuvant for the 
following two injections; 

intraperitoneal challenge with 1 ml Todd-Hewitt culture 
medium containing GBS C388/90 (Ia/c) suspension adjusted 
to 2X10 4 cfu. 
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EXAMPLE 4 Recombinant production of His . Tag-GBS fusion 
protein 

The coding region of a GBS gene was amplified by PCR (DNA 
5 Thermal Cycler GeneAmp PCR system 24 0 0 Perkin Elmer, San 

Jose, CA) from the genomic DNA of GBS strain C3 88/90 (Ia/c) 
using the oligos that contained base extensions for the 
addition of the restriction sites Bglll (AGATCT) and Hindi I I 
(AAGCTT) , respectively. The PCR product was purified from 

10 agarose gel using a Qiaex II gel extraction kit from Qiagen 
(Chatsworth, CA) , digested with the restriction enzymes 
Bglll and Hindi I I (Pharmacia Canada Inc Baie d'Urfe, 
Canada), and extracted with phenol : chloroform before ethanol 
precipitation. The pET-32b(+) vector (Novagen, Madison, WI) 

15 containing the thioredoxin-His . Tag sequence was digested 
with the restriction enzymes Bglll and Hindlll, extracted 
with phenol : chloroform, and then ethanol precipitated. The 
Bglll-Hindlll genomic DNA fragment was ligated to the Bglll- 
Hindlll pET-32b(+) vector to create the coding sequence for 

20 thioredoxin-His .Tag-GBS fusion protein whose gene was under 
control of the T7 promoter. The ligated products were 
transformed into E. coli strain XLI Blue MRF 1 ( A (mcrA) 183A 
{mcrCB-hsdSMR-wrr) 173 endAl supE44 thi-1 recAl gyrA96 relAl 
lac [F'proAB 2acI q ZAM15TnlO (Tet r )] c ) (Stratagene, La Jolla, 

25 CA) according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed.), pp. 109-135). The 
recombinant pET plasmid was purified using a Qiagen kit 
(Qiagen, Chatsworth, CA) and the nucleotide sequence of the 
DNA insert was verified by DNA sequencing (Taq Dye Deoxy 

30 Terminator Cycle Sequencing kit, ABI, Foster City, CA) . The 
recombinant pET plasmid was transformed by electroporation 
(Gene Pulser II apparatus, BIO-RAD Labs, Mississauga, 
Canada) into E. coli strain AD494 (DE3) ( Aara~leu7697 
A2acX74 AphoA PvuII phoR AmalF3 F 1 [2ac + (lacl q ) pro] 

35 trxB::Kan (DE3 ) ) (Novagen, Madison, WI) . In this strain of 
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E. coli, the T7 promoter controlling expression of the 
fusion protein, is specifically recognized by the T7 RNA 
polymerase (present on the A,DE3 prophage) whose gene is 
under the control of the lac promoter which is inducible by 
5 isopropyl-P-D-thio-galactopyranoside (IPTG) . 

The transformant AD494 (DE3 ) /rpET was grown at 37°C with 
agitation at 250 rpm in LB broth (peptone lOg/L, Yeast 
extract 5g/L, NaCl lOg/L) containing 100|ig of ampicillin 
(Sigma-Aldrich Canada Ltd., Oakville, Canada) per mL until 
the A 600 reached a value of 0 . 6 . In order to induce the 
production of the thioredoxin-His . Tag-GBS fusion protein, 
the cells were incubated for 2 additional hours in the 
presence of IPTG at a final concentration of ImM. The 
bacterial cells were harvested by centrif ugation . 

The recombinant fusion protein produced by AD494 (DE3 ) /rpET32 
upon IPTG induction for 2h was partially obtained as 
insoluble inclusion bodies which were purified from 
20 endogenous E. coli proteins by the isolation of insoluble 
aggregates (Gerlach, G.F. et al 1992, Infect. Immun. 
60:892). Induced cells from a 500 mL culture were 
resuspended in 20 mL of 25% sucrose- 5 OmM Tris-HCl buffer 
(pH8.0) and frozen at -70°C. Lysis of cells in thawed 
25 suspension was achieved by the addition of 5mL of a solution 
of lysozyme (lOmg/mL) in 250mM Tris-HCl buffer (pH8.0) 
followed by an incubation of 10 to 15 min on ice, and the 
addition of 15 0mL of detergent mix (5 parts of 2 OmM Tris-HCl 
buffer [pH7.4] -300mM NaCl-2% deoxycholic acid-2% Nonidet P- 
30 40 and 4 parts of lOOmM Tris-HCl buffer [pH8] -50mM EDTA-2% 
Triton X-100) followed by 5 min incubation on ice. Upon 
sonication, protein aggregates were harvested by 
centrifugation for 30 min at 35,000 X g and a sample of the 
soluble cellular fraction was kept. The aggregated proteins 
35 were solubilized in 6M guanidine hydrochloride. The 
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presence of the fusion protein in both the soluble and 
insoluble fractions was shown by Western Blot analysis using 
the serum of a mouse injected with formaldehyde killed cells 
of GBS strain C388/90 (Ia/c) that survived a bacterial 
5 challenge with the corresponding GBS strain. 

The purification of the fusion protein from the soluble 
fraction of IPTG-induced AD4 94 (DE3 ) /rpET was done by 
affinity chromatography based on the properties of the 

10 His. Tag sequence (6 consecutive histidine residues) to bind 
to divalent cations (Ni 2+ ) immobilized on the His. Bind metal 
chelation resin (Novagen, Madison, WI) . The purification 
method used are those described in the pET system Manual, 
6th Edition (Novagen, Madison, WI) . Briefly, the pelleted 

15 cells obtained from a lOOmL culture induced with IPTG was 
resuspended in 4mL of Binding buffer (5mM imidazole-SOOmM 
NaCl-20mM Tris-HCl pH7.9), sonicated, and spun at 39,000 X g 
for 20 min to remove debris. The supernatant was filtered 
(0.45|xm pore size membrane) and deposited on a column of 

20 His. Bind resin equilibrated in Binding buffer. The column 
was then washed with 10 column volumes of Binding buffer 
followed by 6 column volumes of Wash buffer (20mM imidazole- 
500mM NaCl-20mM Tris-HCl pH7.9). The thioredoxin-His . Tag- 
GBS fusion protein was eluted with Elute buffer (1M 

25 imidazole-SOOmM NaCl-20mM Tris-HCl pH7.9) . The removal of 

the salt and imidazole from the sample was done by dialysis 
against 3X1 liter PBS at 4°C. 

The quantities of fusion protein obtained from either the 
30 soluble or insoluble cytoplasmic fractions of E. coli were 
estimated by Coomassie staining of a sodium dodecyl sulfate 
(SDS) -polyacrylamide gel with serial dilutions of these 
proteins and a bovine serum albumin standard (Pierce 
Chemical Co. Rockford, 111.). 

35 
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EXAMPLE 5 Recombinant production of GBS protein under 

control of lambda P L promoter 

The DNA coding region of a GBS protein was inserted 
5 downstream of the promoter XP h into the translation vector 
pURV22. This plasmid was derived from p629 (George et al , 
1987, Bio/Technology 5:600) from which the coding region for 
a portion of the herpes simplex virus type I (HSV-I) 
glycoprotein (gD-1) was removed and the ampicillin 

10 resistance gene replaced by a kanamycin cassette obtained 
from the plasmid vector pUC4K (Pharmacia Biotech Canada 
Inc., Baie D'Urfe, Canada). The vector contained a cassette 
of the bacteriophage X cI857 temperature sensitive repressor 
gene from which the functional P R promoter had been deleted. 

15 The inactivation of the cI857 repressor by temperature 

increase from the ranges of 30-37°C to 37-42°C resulted in 
the induction of the gene under the control of X P L . The 
translation of the gene was controlled by the ribosome 
binding site cro followed downstream by a Bglll restriction 

20 site (AGATCT) and the ATG: ACTAAGGAGGTTAGATCTATG . 

Restriction enzymes and T4 DNA ligase were used according to 
suppliers (Pharmacia Biotech Canada Inc., Baie D'Urfe, 
Canada; and New England Biolabs Ltd., Mississauga, Canada). 

25 Agarose gel electrophoresis of DNA fragments was performed 
as described by Sambrook et al . ( Molecular cloning : A 
laboratory Manual , 1989, Cold Spring Harbor Laboratory 
Press, N.Y) . Chromosomal DNA of the GBS bacteria was 
prepared according to procedures described in Jayarao et al 

30 (J. Clin. Microbiol., 1991, 29:2774). DNA amplification 

reactions by polymerase chain reaction (PCR) were made using 
DNA Thermal Cycler GeneAmp PCR system 24 0 0 (Perkin Elmer, 
San Jose, CA) . Plasmids used for DNA sequencing were 
purified using plasmid kits from Qiagen (Chatsworth, CA) . 

35 DNA fragments were purified from agarose gels using Qiaex II 
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gel extraction kits from Qiagen (Chatsworth, CA) . Plasmid 
transformations were carried out by the method described by 
Hanahan ( DNA Cloning , Glover (ed.) pp, 109-135, 1985). The 
sequencing of genomic DNA inserts in plasmids was done using 
5 synthetic oligonucleotides which were synthesized by 

oligonucleotide synthesizer model 3 94 (the Perkin-Elmer 
Corp., Applied Biosystems Div. (ABI) , Foster City, CA) . The 
sequencing reactions were carried out by PCR using the Taq 
Dye Deoxy Terminator Cycle Sequencing kit (ABI, Foster City, 

10 CA) and DNA electrophoresis was performed on automated DNA 
sequencer 3 73A (ABI, Foster City, CA) . The assembly of the 
DNA sequence was performed using the program Sequencer 3.0 
(Gene Codes Corporation, Ann Arbor, MI) . Analysis of the DNA 
sequences and their predicted polypeptides was performed 

15 with the program Gene Works version 2.45 ( Intelligenetics , 
Inc., Mountain View CA) . 

The coding region of the GBS gene was amplified by PCR from 
GBS strain C388/90 (Ia/c) genomic DNA using oligos that 

20 contained base extensions for the addition of restriction 

sites Bglll (AGATCT) and Xbal (TCTAGA) , respectively. The PCR 
product was purified from agarose gel using a Qiaex II gel 
extraction kit from Qiagen (Chatsworth, CA) , digested with 
the restriction enzymes Bglll and Xbal, and extracted with 

25 phenol : chloroform before ethanol precipitation. The pURV22 
vector was digested with the restriction enzymes Bglll and 
Xbal, extracted with phenol : chloroform, and ethanol 
precipitated. The Bglll -Xbal genomic DNA fragment was 
ligated to the Bglll -Xbal pURV22 vector in which the GBS 

30 gene was under the control of the A-PL promoter. The ligated 
products were transformed into E. coll strain XLI Blue MRF 1 
(A (mcrA) 183A (incrCB-hsdSMR-mrr) 173 endAl supE44 thi-1 recAl 
gyrA96 relAl lac[F' proAB 2acl q ZAM15 TnlO (Tet r ) ] c ) 
(Stratagene, La Jolla CA) according to the methods described 

35 in Hanahan, supra . Transf ormants harboring plasmids with the 

28 



WO 99/42588 



PCT/CA99/00114 



insert were identified by analysis of lysed cells submitted 
to electrophoresis on agarose gel (Sambrook et al, supra ) . 
The recombinant pURV22 plasmid was purified using a Qiagen 
kit (Qiagen, Chatsworth, CA) and the nucleotide sequence of 
5 the DNA insert was verified by DNA sequencing. 

The transformant XL I Blue MRF»/rpURV22 was grown at 34°C 
with agitation at 250 rpm in LB broth containing 50|j.g of 
kanamycin per mL until the A 600 reached a value of 0.6. In 
10 order to induce the production of the fusion protein, the 
cells were incubated for 4 additional hours at 3 9°C. The 
bacterial cells were harvested by centrif ugation , 
resuspended in sample buffer, boiled for 10 min and kept at 
-20°C. 

15 

EXAMPLE 6 Subcloning GBS protein gene in CMV plasmid pCMV-GH 

The DNA coding region of a GBS protein was inserted in phase 
20 downstream of the human growth hormone (hGH) gene which was 

under the transcriptional control of the cytomegalovirus 
(CMV) promoter in the plasmid vector pCMV-GH (Tang et al, 

Nature, 1992, 356:152). The CMV promoter is non functional 

in E. coli cells but active upon administration of the 
25 plasmid in eukaryotic cells. The vector also incorporated 

the ampicillin resistance gene. 

The coding region of the gene was amplified by PCR from 
genomic DNA of GBS strain C388/90 (Ia/c) using the oligos 

30 that contained base extensions for the addition of the 

restriction sites Bglll (AGATCT) and Hindi I I (AAGCTT) . The 
PCR product was purified from agarose gel using a Qiaex II 
gel extraction kit from Qiagen (Chatsworth, CA) , digested 
with the restriction enzymes Bglll and Hindi II, and 

35 extracted with phenol : chloroform before ethanol 

precipitation. The pCMV-GH vector (Laboratory of Dr. Stephen 



29 



WO 99/42588 



PCT/CA99/00114 



A. Johnston, Department of Biochemistry, The University of 
Texas, Dallas, Texas) containing the human growth hormone to 
create fusion proteins was digested with the restriction 
enzymes BamHI and Hindi II, extracted with phenol : chloroform, 
5 and ethanol precipitated. The 1.3-kb Bglll -Hindlll genomic 
DNA fragment was ligated to the BamHI -Hindlll pCMV-GH 
vector to create the hGH-GBS fusion protein under the 
control of the CMV promoter. The ligated products were 
transformed into E. coli strain DH5a[<|)80 lacZ AM15 endAl 

10 recAl hsdRll ( r KT m K + ) supE44 thi-lA," gyrA9G relAl A(2acZYA- 
argF)U169] (Gibco BRL, Gaithersburg, MD) according to the 
methods described by Hanahan, supra . Trans formants 
harboring plasmids with the insert were identified by 
analysis of lysed cells submitted to electrophoresis on 

15 agarose gel (Sambrook, J. et al , supra ) . The recombinant 
pCMV plasmid was purified using a Qiagen kit (Qiagen, 
Chatsworth, CA) and the nucleotide sequence of the DNA 
insert was verified by DNA sequencing. 

20 

EXAMPLE 7 Immunological activity of GBS protein to GBS 
challenge 

Four groups of 12 female CD-I mice (Charles River, St- 
25 Constant, Quebec, Canada) of 6 to 8 weeks were injected 
subcutaneously three times at three week intervals with 
O.lmL of the following antigenic preparations: formaldehyde 
killed cells of GBS strain C388/90 (~6X10 7 cfu) , 20fig of 
thioredoxin-His .Tag-GBS fusion protein obtained from the 
30 insoluble (inclusion bodies) or 20|ag of the fusion protein, 
affinity purified (nickel column) , from the soluble 
cytoplasmic fraction in E.coli, or 20|ig of affinity purified 
(nickel column) thioredoxin-His . Tag control polypeptide. 
20\xg of QuilA™ (Cedarlane Laboratories Ltd, Hornby, Canada) 
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was added to each antigenic preparation as the adjuvant. 
Serum samples were obtained from each mouse before 
immunization (PB) and on days 20 (TBI) , 41 (TB2) and 54 
(TB3) during the immunization protocols. Sera were frozen 
5 at -20°C. 

An increase of the ELISA titers was recorded after each 
injection of the fusion protein indicating a good primary 
response and a boost of the specific humoral immune response 

10 after each of the second and third administration. At the 
end of the immunization period, the means of reciprocal 
ELISA titers was 456,145 for the group immunized with 20|ag 
of fusion protein obtained from inclusion bodies compared to 
290,133 for the group of mice immunized with the protein 

15 from soluble fraction in E.coli. The latter result suggests 
that the protein obtained from inclusion bodies could be 
more immunogenic than the soluble protein. Analysis of mice 
sera in ELISA using the affinity purified thioredoxin- 
His.Tag to coat plates showed that negligible antibody 

20 titers are made against the thioredoxin-His . Tag portion of 
the fusion protein. The reactivity of the sera from mice 
injected with the recombinant fusion protein was also tested 
by ELISA against formaldehyde killed whole cells of GBS 
strain C388/90. The antibodies induced by immunization with 

25 recombinant fusion protein also recognized their specific 

epitopes on GBS cells indicating that their conformation is 
close enough to the native streptococcal protein to induce 
cross-reactive antibodies. 

30 To verify whether the immune response induced by 

immunization could protect against GBS infection, mice were 
challenged with 3.5X10 5 cfu of GBS strains C338/90 ( Ia/c) and 
1.2X10 5 cfu of strain NCS246(II/R) the results of which are 
illustrated in tables 3 and 4 respectively. Mice immunized 

35 with control thioredoxin-His . Tag peptide were not protected 
against challenge with either GBS strain while those 
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immunized with formaldehyde killed C388/90 whole cells only 
provided protection against homologous challenge. The 
thioredoxin-His .Tag-GBS fusion protein of the invention 
protected mice from challenge with both GBS strains. Blood 
5 and spleen culture of these mice did not reveal the presence 
of any GBS . 
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Table 3 Survival from GBS strain C388/90 (Ia/c) challenge 1 



i mmi l t~\ 17 1 ncr p\ ctp>ti t~ 


surviving 
challenge 


o, 
*o 

survival 


Liiiui cuUAj.ii nxo • x ay 


± / b 


17 


formaldehyde killed C3 8 8/9 0 
cells 3 


6/6 


100 


thioredoxin-His . Tag -GBS fusion 
(inclusion body preparation) 4 


6/6 


100 


thioredoxin-His . Tag-GBS fusion 
(cytoplasmic fraction) 4 


6/6 


100 



1 intraperitoneal administration with 1 ml Todd-Hewitt 
5 culture medium adjusted to 3.5X10 5 cfu; 

2 2 0fig administered; posterior legs paralyzed in surviving 
mouse; GBS detected in blood and spleen; 

3 6X10 7 cfu administered; 

4 2 0fig administered. 
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Table 4 Survival from GBS strain NCS24 6 (II/R) challenge 1 



immunizing agent 


no. mice 
surviving 
challenge 


% survival 


thioredoxin-His .Tag 2 


0/6 


0 


formaldehyde killed C3 88/90 
cells 3 


2/6 


34 


thioredoxin-His . Tag-GBS fusion 
(inclusion body preparation) 2 


5 / 5 4 


100 


thioredoxin-His . Tag-GBS fusion 
(cytoplasmic fraction) 2 


6/6 


100 



5 1 intraperitoneal administration with 1 ml Todd-Hewitt 
culture medium containing GBS NCS246(II/R) suspension 
adjusted to 1.2X10 5 cfu. 

2 2 0jag administered; 

3 6X10 7 cfu administered; 

10 4 one mouse died during immunization. 

EXAMPLE 8 Immunization with recombinant GBS protein confers 
protection against experimental GBS infection 

15 

This example illustrates the protection of mice against 
fatal GBS infection by immunization with the recombinant 
protein corresponding to the SEQ ID NO: 39. 

20 Groups of 10 female CD-I mice (Charles River) were immunized 
subcutaneously three times at three-week intervals with 20 
\ig of recombinant protein purified from E^ coli strain BLR 
(Novagen) harboring the recombinant pURV22 plasmid vector 
containing the GBS gene corresponding to SEQ ID NO: 42 in 

25 presence of 20 \ig of QuilA™ adjuvant (Cedarlane 

Laboratories Ltd, Hornby, Canada) or, as control, with 
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QuilA™ adjuvant alone in PBS. Blood samples were collected 
from the orbital sinus on day 1, 22 and 43 prior to each 
immunization and fourteen days (day 57) following the third 
injection. One week later the mice were challenged with 
approximately 10 4 to 10 6 CFU of various virulent GBS strains. 

Samples of the GBS challenge inoculum were plated on TSA/5% 
sheep blood agar plates to determine the CFU and to verify 
the challenge dose. Deaths were recorded for a period of 14 
days and on day 14 post-challenge, the surviving mice were 
sacrificed and blood and spleen were tested for the presence 
of GBS organisms. The survival data are shown in table 5. 

Prechallenge sera were analyzed for the presence of 
antibodies reactive with GBS by standard immunoassays. Elisa 
and immunoblot analyses indicated that immunization with 
recombinant GBS protein produced in E. coli elicited 
antibodies reactive with both, recombinant and native GBS 
protein. Antibody responses to GBS are described in Example 
9. 
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Table 5. Ability of recombinant GBS protein corresponding 
to SEQ ID NO: 39 to elicit protection against 8 diverse GBS 
challenge strains 





Challenge strain 




Immunogeri 


Designation 


Type 


No. alive: No. dead 1 


rGBS protein 
none 


C388/90 


Ia/c 


8 : 2 (P<0.0001) 
0 : 10 


rGBS protein 
none 


NCS 246 


II/R 


10 : 0 (P=0.0012) 
3 : 7 


rGBS protein 
none 


ATCC12401 


lb 


10 : 0 (P=0.001) 
3 : 7 


rGBS protein 
none 


NCS 535 


V 


10 : 0 (P=0.01) 
5 : 5 


rGBS protein 
none 


NCS 9842 


VI 


10 : 0 (P<0.0001) 
0 : 10 


rGBS protein 
NCS 915-F 3 
none 


NCS 915 


III 


7 : 3 (P=0.0007) 2 

1:9 

4 : 6 


rGBS protein 
NCS 954-F 
none 


NCS 954 


III/R 


7:3 ( P=0.002) 
4 : 6 
1 : 9 


rGBS protein 

C0H1-F 

none 


C0H1 


III 


4 : 6 (P=0.0004) 
3 : 7 
0 : 10 



1 Groups of 10 mice per group were used, the number of mice 

surviving to infection and the number of dead mice are 
10 indicated. The survival curves corresponding to recombinant 

GBS protein-immunized animals were compared to the survival 
curves corresponding to mock-immunized animals using the log- 
rank test for nonparametric analysis. 

2 Comparison analysis to NCS915-F-immunized animals. 

15 Animals were immunized with formaldehyde-killed GBS in 

presence of QuilA™ adjuvant. 



All hemocultures from surviving mice were negative at day 14 
20 post-challenge. Spleen cultures from surviving mice were 
negative except for few mice from experiment MB-11. 
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EXAMPLE 9 Vaccination with the recombinant GBS protein 
elicits an immune response to GBS 

Groups of 10 female CD-I mice were immunized subcutaneously 
5 with recombinant GBS protein corresponding to SEQ ID NO: 39 
as described in Example 8 . In order to assess the antibody 
response to native GBS protein, sera from blood samples 
collected prior each immunization and fourteen days after 
the third immunization were tested for antibody reactive 

10 with GBS cells by ELISA using plates coated with 

formaldehyde-killed GBS cells from type III strain NCS 954, 
type lb strain ATCC12401, type V strain NCS 535 or type VI 
strain NCS 9842. The specificity of the raised antibodies 
for GBS protein was confirmed by Western blot analyses to 

15 GBS cell extracts and purified recombinant antigens. The 

results shown in Figure 10 clearly demonstrate that animals 
respond strongly to recombinant GBS protein used as 
immunogens with median reciprocal antibody titers varying 
between 12000 and 128000, for sera collected after the third 

20 immunization, depending of the coating antigen. All 

preimmune sera were negative when tested at a dilution of 
1 :100. GBS-reactive antibodies were detectable in the sera 
of each animal after a single injection of recombinant GBS 
protein . 

25 
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Example 10 Antigenic conservation of the GBS protein of the 
present invention 

Monoclonal antibodies (MAbs) specific to the GBS protein of 
5 the present invention were used to demonstrate that this 
surface antigen is produced by all GBS and that it is also 
antigenically highly conserved. 

A collection of 68 GBS isolates was used to evaluate the 

10 reactivity of the GBS-specific MAbs. These strains were 
obtained from the National Center for Streptococcus, 
Provincial Laboratory of Public Health for Northern Alberta, 
Canada; Centre Hospitalier Universitaire de Quebec, Pavilion 
CHUL, Quebec, Canada; American Type Culture Collection, USA; 

15 Laboratoire de Sante Publique du Quebec, Canada; and Dept. 
of Infectious Disease, Children's Hospital and Medical 
Center, Seattle, USA. All eight Mabs were tested against 
the following panel of strains: 6 isolates of serotype la or 
Ia/c, 3 isolates of serotype lb, 4 isolates of serotype II, 

20 14 isolates of serotype III, 2 isolates of serotype IV, 2 
isolates of serotype V, 2 isolates of serotype VI, 2 
isolates of serotype VII, l isolate of serotype VIII, 10 
isolates that were not serotyped and 3 bovine S. agalactiae 
strains. MAb 3A2 was also reacted with additional GBS: 9 

25 isolates of serotype Ia/c and 10 isolates of serotype V. 
The strains were grown overnight on blood agar plates at 
37°C in an atmosphere of 5% C0 2 . Cultures were stored at - 

70°C in heart infusion broth with 20% (v/v) glycerol. 

30 To obtain the GBS protein-specific MAbs, mice were immunized 
three times at three-week intervals with 20 /xg of purified 
recombinant GBS protein. (SEQ ID NO :44) in the presence of 
20% QuilA™ adjuvant. Hybridoma cell lines were generated by 
fusion of spleen cells recovered from immunized mice with 

35 the nonsecreting SP2/0 myeloma cell line as described 
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previously (Hamel, J. et al . 1987. J. Med. Microbiol. 
23:163-170). Hybrid clone supernatants were tested for 
specific antibody production by ELISA using formaldehyde 
inactivated GBS and purified recombinant GBS protein (SEQ ID 
5 NO : 39 or 44) as coating antigen, as previously described 
(Hamel, J. et al . 1987. J. Med. Microbiol. 23:163-170). 
Specific hybrid were cloned by limiting dilutions, expanded, 
and frozen in liquid nitrogen. Production of recombinant GBS 
protein was presented in Examples 4 & 5 . Purified 

10 recombinant GBS protein or formaldehyde inactivated GBS were 
resolved by electrophoresis by using the discontinuous 
buffer system of Laemmli as recommended by the manufacturer 
and then transfer onto nitrocellulose membrane for Western 
immunoblotting as described previously (Martin et al. 1992. 

15 Infect. Immun. 60:2718-2725). 

Western immunoblotting experiments clearly indicated that 
all eight MAbs recognized a protein band that corresponded 
to the purified recombinant GBS protein (SEQ ID NO :39) . 

20 These MAbs also reacted with a protein band present in every 
GBS isolates tested so far. The reactivity of these GBS- 
specific MAbs are presented in Table 6. Each MAb reacted 
well with all 46 GBS. In addition, these MAbs also 
recognized the 3 S. agal actiae strains of bovine origin that 

25 were tested. MAb 3A2 also recognized nineteen GBS; 9 

isolates of serotype Ia/c and 10 of serotype V. The other 
MAbs were not tested against these additional strains. 

These results demonstrated that the GBS protein (SEQ ID 
30 NO :39) was produced by all the 65 GBS and the three 3 S. 

agalactiae strains of bovine origin that were tested so far. 
More importantly, these results clearly demonstrated that 

the epitopes recognized by these eight GBS-specific MAbs 

were widely distributed and conserved among GBS . These 
35 results also indicated that these epitopes were not 
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restricted to serologically related isolates since 
representatives of all known GBS serotypes including the 
major disease causing groups were tested. 

In conclusion, the data presented in this example clearly 
demonstrated that the GBS protein of the present invention 
is produced by all GBS and that it is antigenically highly 
conserved . 
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WE CLAIM: 

1. An isolated polynucleotide encoding a polypeptide 
having at least 70% identity to a second polypeptide 
having a sequence selected from the group consisting 
of: 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 14 , SEQ ID NO : 15 , 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 
95% identity to the second polypeptide. 

3 . An isolated polynucleotide encoding a polypeptide 
capable of generating antibodies having binding 
specificity for a polypeptide having a sequence 
selected from the group consisting of: 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO : 12 , SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO : 3 9 , 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 
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4. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 1. 

5. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 3 . 

6. The polynucleotide of claim 1, wherein said 
polynucleotide is DNA. 

7. The polynucleotide of claim 3, wherein said 
polynucleotide is DNA. 

8. The polynucleotide of claim 1, wherein said 
polynucleotide is RNA. 

9. The polynucleotide of claim 3, wherein said 
polynucleotide is RNA. 

10. A polynucleotide which hybridizes under stringent 
conditions to a second polynucleotide having a sequence 
selected from the group consisting of : 

SEQ ID NO : 1, SEQ ID NO : 7, SEQ ID NO : 13 , SEQ ID 
NO : 22, SEQ ID NO : 27, SEQ ID NO : 32, SEQ ID NO : 
37, SEQ ID NO : 42 and SEQ ID NO : 43 or fragments, 
analogues or derivatives thereof. 

11. A polynucleotide which hybridizes under stringent 
conditions to a second polynucleotide having a sequence 
selected from the group consisting of : 

SEQ ID NO : 37, SEQ ID NO : 42 and SEQ ID NO : 43 . 

12. A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 37. 
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13 . A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 42. 

14 . A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 43. 

15. A polynucleotide according to claim 10 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

16 . A polynucleotide according to claim 11 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

17. A vector comprising the polynucleotide of claim l, 
wherein said polynucleotide is operably linked to an 
expression control region. 

18. A vector comprising the polynucleotide of claim 3, 
wherein said polynucleotide is operably linked to an 
expression control region. 

19. A host cell transfected with the vector of claim 17. 

20. A host cell transfected with the vector of claim 18. 

21. A process for producing a polypeptide comprising 
culturing a host cell according to claim 19 under 
conditions suitable for expression of said polypeptide. 

22. A process for producing a polypeptide comprising 
culturing a host cell according to claim 2 0 under 
condition suitable for expression of said polypeptide. 
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23. An isolated polypeptide having at least 70% identity to 
a second polypeptide having a sequence selected from 
the group consisting of: 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO : 12 , SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 

24. The isolated polypeptide of claim 23 having a sequence 
according to SEQ ID NO : 39. 

25. The isolated polypeptide of claim 23 having a sequence 
according to SEQ ID NO : 44. 

26. An isolated polypeptide capable of generating 
antibodies having binding specificity for a second 
polypeptide having a sequence selected from the group 
consisting of: 

SEQ ID NO: 2 , SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 34, 
SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 3 8, SEQ ID NO : 3 9 , 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 
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27. The isolated polypeptide of claim 26 having a sequence 
according to SEQ ID NO : 39. 

28. The isolated polypeptide of claim 26 having a sequence 

according to SEQ ID NO : 44. 

29. An isolated polypeptide having an amino acid sequence 
selected from the group consisting of: 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO: 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 34, 
SEQ ID. NO: 35, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 39, 
SEQ ID NO: 40 and SEQ ID NO: 41 or fragments, analogs or 
derivatives thereof . 

30. The isolated polypeptide of claim 29 having an amino 
acid sequence according to SEQ ID NO : 39. 

31. An isolated polypeptide having an amino acid sequence 
according to SEQ ID NO : 44 . 

32. An isolated polypeptide according to any one of claims 
2 9 to 31, wherein the N- terminal Met residue is 
deleted . 

33. An isolated polypeptide according to any one of claims 
29 to 30, wherein the secretory amino acid sequence is 
deleted. 

34. A vaccine composition comprising a polypeptide according 
to any one of claims 23 to 31 and a pharmaceutically 
acceptable carrier, diluent or adjuvant. 
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35. A vaccine composition comprising a polypeptide 
according to claim 32 and a pharmaceutically acceptable 
carrier, diluent or adjuvant. 

36. A vaccine composition comprising a polypeptide 
according to claim 3 3 and a pharmaceutically acceptable 
carrier, diluent or adjuvant. 

37. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 

34 . 

38. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 

35 . 

39. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 

36 . 

40. A method according to any one of claims 3 7 to 39, 
wherein said animal is a bovine. 

41. A method according to any one of claims 3 7 to 39, 
wherein said animal is a human. 
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42 . 



43 . 



A method according to any one of claims 3 7 to 39, 
wherein said bacterial infection is selected from the 
group consisting of group A streptococcus and group B 
streptococcus . 

A method according to claim 42, wherein said bacterial 
infection is group B streptococcus. 



44 



Use of a vaccine composition according to claim 34 for 
the therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to or infected with streptococcal infection 
comprising administering to said animal a therapeutic 
or prophylactic amount of the composition. 



45. Use of a vaccine composition according to any one of 
claims 35 to 36 for the therapeutic or prophylactic 
treatment of streptococcal bacterial infection in an 
animal susceptible to or infected with streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 



46. Use of a vaccine composition according to any one 

claims 23 to 31 for the manufacture of a vaccine for 
the therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of the composition. 

47. Use of a vaccine composition according to claim 32 for 
the manufacture of a vaccine for the therapeutic or 
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prophylactic treatment of streptococcal bacterial 
infection in an animal susceptible to streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 

48. Use of a vaccine composition according to claim 33 for 
the manufacture of a vaccine for the therapeutic or 
prophylactic treatment of streptococcal bacterial 
infection in an animal susceptible to streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 
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TATCTGGCAA AGAGCCAGCT AATCGTTTTA 
SGK EPA NRFS 

GATTCATTGC AACTCTAGCA GCAACTATCT 
F I A TLA ATIL 

AACCAGATTA CCCTGGAAAA ACCTACTTTA 
PDY PGK TYFI 

CATTAGTAAC TGCTTTAGTG GGATGGGATA 
LVT A L V GWDN 

TAT TAT TAT T CCAGCTTGGT TCAAGCGCAG 
LLF QLG SSAG 

TCTTTCAAAC AATTCAACCA TTTTTACCGA 
FQT IQP FLPM 

CCATCTCGTT GACGGGAGAC GTTAACCATC 
ISL TGD VNHQ 

CATCGATGAT ACTTGCTCTT CTTATTTATC 
SMI LAL LIYR 

GTGATAGACT AACAGTATGA TATGGTATGT 



CTTTAACAAT AATTATTGCA ACATTAACTG 
LTI I I A T LTA 

AGACGTTAGC CACCCAGTCA AATATGACTG 
TLA TQS NMTG 

TGTCATATTT ACCCGTTATT AAACTTTTTA 
SYL P V I KLFK 

GCCTATTCCT CCTTTATGGG TTATATATTT 
LFL LYG LYIS 

TAATCAATGT ATTGCTAGTT GCTATTTATG 
INV LLV AIYG 

TAAAACAGGG TGGTTTACCT ATATTAGCTC 
KQG GLP ILAL 

CCGTTCGATT TAGTTGAACG GCTTTTAGTA 

AAGTAATTTG TCTTTAAAAA TAGTATAATA 

AT TAT G ACT G AGAACTGGTT AC AT AC T AAA 
MTE NWL HTK 

I — > 

GGTCAAGGTC AACCGATTGT TTTTTTACAT 
GQGQ PIV FLH 

AAGCAAATAG CATATTTTTC TAAGTATTAC 
KQIA YFS KYY 

GGCAAAAGTC ATGCAAAGCT AAAT AC CAT T 
GKSH AKL NTI 



GTTGGGCTAA AAAT AAAT T A TTAATCAATG 60 
WAK NKL LING 

TATTTTTTGC AGTTCAATTC ATAGGTCTTA 120 
FFA VQF IGLK 

TTATCCTATT GACAGCATGG ACTTTGATGG 180 
ILL TAW TLMA 

ATAGGTATGG TTCCTTCTTG TCGTTATTAA 24 0 
RYG SFL SLLI 

GAACTTACCC AATAGAATTG AGTCCTAAGT 300 
TYP IEL SPKF 

TGACTTACTC TGTTTCAGGA TTAAGAGAGA 360 
TYS VSG LRET 

AATGGAGAAT GCTAGTAATC TTTTTAGTAT 420 
WRM LVI FLVS 

GTAAACAAGA AGATTAATAG AAAGTATCTA 4 80 

K Q E D 

CAAAGTATTT AGGAGGAGAA GATATGTCTA 540 

M S T 
|-~ > 

CTTTGGAACA TTTTTATATT ATGTATTTGG 600 
LEH FYI MYLE 

GGAAGATTTT TAGTATGTCT AAAGAAGAGT 660 
KIF SMS KEEL 

AGAATCAAGG TGTATACAAC GGCTTGATTG 720 
NQG VYN GLIG 

CACAGAATCA AGAAATTGTA GCTGTTTTTT 780 
QNQ EIV AVFL 

GTGCTTTGAC AGTTGATAAA AAAATCTTAT 84 0 

ALT V.DK KILL 

TTTTAACATT CTTATTTTAA TACTACTTAG 900 
L T F L F 

ATCATTTTTT TCTCATAATA CAGGTAGTTT 960 

TAACTACGAA TTCAAAGAGA GGTGACTTTG 1020 

GATGGTTCAG ATATTTATTA TCGTGTQGTT 108 0 
DGSD IYY RVV 

GGCAATAGCT TAAGTAGTCG CTATTTTGAT 114 0 
GNSL SSR YFD 

CAAGTTATTG T TAT GG AT AG TAGAGGGCAT 1200 
QVIV MDS RGH 

AGTTTCAGGC AAATAGCAGT TGACTTAAAG 12 60 
SFRQ IAV DLK 
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GATATCTTAG TTCATTTAGA GAT T GAT AAA GTTATATTGG TAGGCCATAG CGATGGTGCC 1320 
DILV HLE IDK VILV GHS DGA 

AATTTAGCTT TAGTTTTTCA AACGATGTTT CCAGGTATGG TTAGAGGGCT TTTGCTTAAT 138 0 
NLAL VFQ TMF PGMV RGL LLN 

TCAGGGAACC TGACTATTCA TGGTCAGCGA TGGTGGGATA TTCTTTTAGT AAGGATTGCC 14 40 
SGNL TIH GQR WWDI LLV RIA 

TATAAATTCC TTCACTATTT AGGGAAACTC TTTCCGTATA TGAGGCAAAA AGCTCAAGTT 1500 
YKFL HYL GKL FPYM RQK A Q V 

ATTTCGCTTA TGTTGGAGGA TTTGAAGATT AGTCCAGCTG ATTTACAGCA TGTGTCAACT 1560 
ISLM LED LKI SPAD LQH VST 

CCTGTAATGG TTTTGGTTGG AAATAAGGAC ATAATTAAGT TAAATCATTC TAAGAAACTT 1620 
PVMV LVG NKD IIKL NHS KKL 

GCTTCTTATT TTCCAAGGGG GGAGTTTTAT TCTTTAGTTG GCTTTGGGCA TCACATTATT 1680 
ASYF PRG EFY SLVG FGH HII 

AAGCAAGATT CCCATGTTTT TAATATTATT GCAAAAAAGT TTATCAACGA TACGTTGAAA 174 0 
KQDS HVF Nil AKKF IND TLK 

GGAGAAATTG TTGAAAAAGC TAATTGAAAA AGTCAAATCA CTGACTTCTG TGATTAAAAT 1800 
GEIV EKA N 

TGTATTTTTT ATATCTGTTT TAGTGCTTAT TATTGTTGAA AT GAT T CAT T TGAAACGAAC 18 60 

M I H L K R T 
| > 

TATTTCTGTT GAGCAACTAA AGAGTGTTTT TGGGCAATTA TCTCCAATGA ATCTTTTCTT 1920 
ISV EQLK SVF G QL SPMN LFL 

AATTATCCTT GTGGGGGTTA TCGCTGTCTT ACCGACAACC GGATATGACT TTGTACTGAA 1980 
I I L V G V I A V L PTT GYDF VLN 

TGGACTTTTA CGTACAGATA AAAGCAAAAG GTATATTTTA CAGACTAGTT GGTGTATCAA 204 0 
GLLR TDKS KRY I L Q TSWC IN 

CACTTTTAAT AACTTGTCAG GATTCGGTGG CTTAATCGAT ATTGGGTTGC GCATGGCTTT 2100 
TFNN LSGF GGL IDI GLRM A F 

TTATGGTAAA AAAGGTCAAG AGAAGAGTGA CCTAAGAGAA GTGACTCGTT TTTTACCCTA 2160 
YGKK GQEK SDL REV TRFL PY 

TCTTATTTCT GGTCTGTCAT TTATTAGTGT GATTGCCTTA ATCATGAGCC ATATTTTTCA 2220 
LISG LSFI SVI A L I MSHI FH 

TGCCAAAGCT AGTGTTGATT AC TAT TAT TT GGTATTAATT GGTGCTAGTA TGTATTTTCC 228 0 
AKAS VDYY YLV LIG ASMY FP 

TGTTATTTAT TGGATTTCTG GTCATAAAGG AAGCCATTAT TTCGGAGATA TGCCATCTAG 234 0 
VIYW ISGH KGS HYF GDMP SS 

TACTCGTATA AAATTAGGTG TTGTTTCTTT TTTTGAATGG GGATGTGCGG CCGCAGCATT 24 00 
TRIK LGVV SFF EWG C A A A AF 

TATAATTATC GGTTATTTAA TGGGCATTCA TCTACCAGTT TATAAAATTT TACCACTATT 24 60 
I I I G YLMG IHL PVY KILP LF 
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TTGTATTGGT TGTGCCGTCG GGATTGTATC 
CIGC AVGI VS 

ATTAGTTCTA TTTACAGGGT TTGCTGCCGA 
LVLF TGFA A E 

GTTATTACTT TATCGTTTAG CCTACTATAT 
LLLY RLAY YI 

TATCCATTAT TTAGGTAGTC AAATAAATCA 
IHYL GSQI NQ 

ATCAACTGTT CTACAAACCA TGGTGAGCCA 
STVL QTMV SH 

ATTTTCAACA GCATTTTTTG AAAATATTAC 
F S T A FFEN IT 

GGACCCATTA CAAGAACAAA TGTTATGGCA 
DPLQ EQML WQ 

TATTCTCTTA GCTAGAACTA TTGATCAAAA 
ILLA RTID QK 

CTG GAT TACT TTGACATTGT TTTATCTTAA 
WITL TLFY LN 

CTGGTTTATT TTACTATTGT TAGGCTTATT 
WFIL LLLG LL 

ATTTATTTAT AGCTGGGAAG AGCGTATTAA 
FIYS WEER IK 

AGTTCTATTT TATATTGCAG GAG TACT ATT 
VLFY IAGL LF 

TATTGAACGC CTGCATTATA TCATAGCATG 
IERL HYII AW 

TACTCTCGTT TATTTATGTT TGGTTAAGAT 
TLVY LCLV KI 

TGTGTTCAAT GTGGATCGTT ATAAAAAACT 
VFNV DRYK KL 

CGGTTTAGCC TTTTTAAATG ATAAAAGGCT 
GLAF LND K RL 

CGTTGCGTTC CAATTTGTAA TTGTCAATAA 
VAFQ FVIV N N 

T GAT G AC ACT TATATTCGTG AAGCTATTGA 
DDTY IREA IE 

CTATGACCTT GTTTTTTACA GTATTGGACA 
YDLV FYSI GQ 

TTTTGACTTT ATGAAAGTTG GTGAGGATGC 
FDFM KVGE DA 
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CCTTATTCCC GGTGGATTAG GAAGTTTTGA 2520 
L IPG GLGS FE 

GGGACTACCT AAAGAAACTG TGGTTGCATG 2580 
G LPK ETVV AW 

TATTCCATTC TTTGCAGGTA TCTATTTCTT 264 0 
I PFF AGIY FF 

ACGTTATGAA AATGTCCCGA AAG AG T TAG T 2700 
R YEN VPKE LV 

TTTGATGCGT ATTTTAGGTG CATTCTTAAT 27 60 
L MRI LGAF LI 
| > 

TTATATTATG TGGTTGCAGA AGCTAGGCTT 2820 

Y IMW LQKL GL 

GTTTCCAGGT TTATTGCTGG GGGTTTGTTT 2880 
F PGL LLGV CF 

AGTGAAAAAT GCTTTTCCAA TTGCTATTAT 294 0 

V KNA FPIA II 

TTTAGGTCAT ATTAGTTGGC GACTATCTTT 3000 
L GHI.SWRL SF 

AG T CAT T AAG CCAACTCTCT ATAAAAAACA 3060 

V IKP TLYK KQ 

GGATGGAATC ATTATCGTTA GTTTAATGGG 3120 
D GII IVSL MG 

CCCTATCAGG GCTCATATTA CAGGTGGTAG 3180 
P IRA HITG GS 

GGAGCCGATA GCATTGGCTA CGTTGATTCT 324 0 
E PIA LATL IL 

TTTACAAGGA AAATCTTGTC AGATTGGTGA 3300 
L QGK SCQI GD 

ACTTCAAGCT TACGGTGGTT CTTCGGATAG 3360 
L QAY G G S S DS 

CTACTGGTAC CAAAAAAATG GAGAAGATTG 3420 

Y WYQ KNGE DC 

TAAATGTCTT ATTATGGGGG AACCAGCCGG 34 80 
K CLI MGEP AG 

ATCGTTTATT GATGATGCTG ATAAGCTAGA 354 0 
S FID DADK LD 

GAAGTTGACA CTACTTTTAC ATGAGTATGG 3 600 
K LTL LLHE YG 

TTTAGTTAAT TTAGAAACGT TTACTCTTAA 3660 
L VNL ETFT LK 
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AGGGAATAAG TACAAACCTT TCAGAAATGC CCTAAATAGA GTTGAAAAGG ATGGTTTCTA 3720 
GNKY KPFR NAL NRV EKDG FY 

TTTCGAAGTT GTACAAT CGC CACATAGTCA AGAGCTACTA AATAGTTTGG AAGAGATTTC 3780 
FEVV QSPH SQE LLN SLEE IS 

TAATACTTGG TTAGAAGGAC GTCCTGAAAA AGGTTTCTCA CTAGGATATT TTAATAAAGA 384 0 
NTWL EGRP E K G FSL GYFN KD 

TTATTTCCAA CAAGCCCCAA TAGCTTTGGT AAAAAATGCT GAACACGAAG TTGTTGCTTT 3 900 
YFQQ APIA LVK NAE HEVV A F 

TGCTAATATT ATGCCAAACT AT G AAAAG AG TATTATCTCT ATTGATTTAA TGCGTCACGA 3 960 
ANIM PNYE KSI I *S I DLMR HD 

TAAACAGAAA ATTCCGAATG GCGTTATGGA TTTCCTCTTT T TAT CAT TAT TCTCTTATTA 4 020 
KQKI PNGV MDF LFL SLFS YY 

TCAAGAGAAG GG AT ACCACT ATTTTGATTT GGGGATGGCA CCTTTATCAG GAGTTGGTCG 4 080 
QEKG YHYF DLG MAP LSGV GR 

CGTTGAAACA AGTTTTGCTA AAGAGAGAAT GGCGTATCTT GTCTATCATT TCGGTAGTCA 4140 
VETS FAKE RMA YLV YHFG SH 

TTTCTACTCA TTTAATGGTT TACACAAGTA TAAGAAGAAG TTTACACCAT TGTGGTCGGA 4200 
FYSF NGLH KYK KKF TPLW SE 

AC GT TAT ATT TCTTGTTCTC GTTCGTCCTG GTTAATTTGT GCTATTTGTG CCCTATTAAT 4 2 60 
R Y I S CSRS SWL ICA ICAL LM 

GGAAGATAGT AAAATTAAGA TTGTTAAATA AGCTTTATTT GGCAATTAAA AAGAGCATGT 4 320 
EDSK I K I V K 

CATGCGACAT GCTCTTTTTA AATCATTTAA TACCATTGAT TGCTTGAATC TACTTTATAA 4 380 

TATGATGTGC TTTTAAATAT TGTTTAGCTA CTGTAGCTGC TGATTTATGC TTTACAGCTA 4 44 0 

CTTGGTAGTT CATTTCTTGC ATTTCTTTTT CAGTGATATG ACCAGCAAGT T TAT T GAG AG 4 500 

CTTTTTTTAC TTGA (SEQ I D NO : 1 ) 4514 

FIG. la 
[clonel-dna/aa] 
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SGKEPANRFS WAKNKLLING FIATLAATIL FFAVQFIGLK PDYPGKTYFI 50 

ILLTAWTLMA LVTALVGWDN RYGSFLSLLI LLFQLGSSAG TYPIELSPKF 100 

FQTIQPFLPM TYSVSGLRET ISLTGDVNHQ WRMLVIFLVS SMILALLIYR 150 

KQED (SEQ ID NO : 2 ) 154 

FIG. lb 



MSTLTITIAT 

LIU 1 111 _1_ -L. _L ^1 X 


T.T AT.FHPYTM 


YLETLATQSN 


MTGKIFSMSK 


EELSYLPVIK 


50 


LFKNQGVYNG 


LIGLFLLYGL 


YISQNQEIVA 


VFLINVLLVA 


-L x o^-iXj 1 V Us\l\ 


inn 

1UU 


ILLKQGGLPI 


LALLTFLF 


(SEQ ID NO:3) 




i i p 

JL _L D 






FIG. lc 






MTENWLHTKD 


GSDIYYRVVG 


QGQPIVFLHG 


NSLSSRYFDK 


QIAYFSKYYQ 


50 


VIVMDSRGHG 


KSHAKLNTIS 


FRQIAVDLKD 


ILVHLEIDKV 


ILVGHSDGAN 


100 


LALVFQTMFP 


GMVRGLLLNS 


GNLTIHGQRW 


WDILLVRIAY 


KFLHYLGKLF 


150 


PYMRQKAQVI 


SLMLEDLKIS 


PADLQHVSTP 


VMVLVGNKDI 


IKLNHSKKLA 


200 


SYFPRGEFYS 


LVGFGHHIIK 


QDSHVFNIIA 


KKFINDTLKG 


EIVEKAN 


247 


(SEQ ID NO: 


4) 














FIG. 


Id 
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MIHLKRTISV 


EQLKSVFGQL 


SPMNLFLIIL 


VGVIAVLPTT 


GYDFVLNGLL 


50 


RTDKSKRYIL 


QTSWCINTFN 


NLSGFGGLID 


IGLRMAFYGK 


KGQEKSDLRE 


100 


VTRFLPYLIS 


GLSFISVIAL 


IMSHIFHAKA 


SVDYYYLVLI 


GASMYFPVIY 


150 


WISGHKGSHY 


FGDMPSSTRI 


KLGVVSFFEW 


GCAAAAFIII 


GYLMGIHLPV 


200 


YKILPLFCIG 


CAVGIVSLIP 


GGLGS FELVL 


FTGFAAEGLP 


KETVVAWLLL 


250 


YRLAYYI IPF 


FAGIYFFIHY 


LGSQINQRYE 


NVPKELVSTV 


LQTMVSHLMR 


300 


ILGAFLIFST 


AFFENITYIM 


WLQKLGLDPL 


QEQMLWQFPG 


LLLGVCFILL 


350 


ARTIDQKVKN 


AFPIAIIWIT 


LTLFYLNLGH 


ISWRLSFWFI 


LLLLGLLVIK 


400 


PTLYKKQFIY 


SWEERIKDGI 


IIVSLMGVLF 


YIAGLLFPIR 


AHITGGSIER 


450 


LHYIIAWEPI 


ALATLILTLV 


YLCLVKILQG 


KSCQIGDVFN 


VDRYKKLLQA 


500 


YGGSSDSGLA 


FLNDKRLYWY 


QKNGEDCVAF 


QFVIVNNKCL 


IMGEPAGDDT 


550 


YIREAIESFI 


DDADKLDYDL 


VFYSIGQKLT 


LLLHEYGFDF 


MKVGEDALVN 


600 


LETFTLKGNK 


YKPFRNALNR 


VEKDGFYFEV 


VQSPHSQELL 


NSLEEISNTW 


650 


LEGRPEKGFS 


LGYFNKDYFQ 


QAPIALVKNA 


EHEVVAFANI 


MPNYEKSIIS 


700 


IDLMRHDKQK 


IPNGVMDFLF 


LSLFSYYQEK 


GYHYFDLGMA 


PLSGVGRVET 


750 


SFAKERMAYL 


VYHFGSHFYS 


FNGLHKYKKK 


FTPLWSERYI 


SCSRSSWLIC 


800 


AICALLMEDS 


KIKIVK (SEQ ID NO : 5 ) 






816 






FIG. 


le 






MRILGAFLIF 


STAFFENITY 


IMWLQKLGLD 


PLQEQMLWQF 


PGLLLGVCFI 


50 


LLARTIDQKV 


KNAFPIAIIW 


ITLTLFYLNL 


GHISWRLSFW 


FILLLLGLLV 


100 


IKPTLYKKQF 


I YSWEERIKD 


GIIIVSLMGV 


LFYIAGLLFP 


IRAHITGGSI 


150 


ERLHYIIAWE 


PIALATLILT 


LVYLCLVKIL 


QGKSCQIGDV 


FNVDRYKKLL 


200 


QAYGGSSDSG 


LAFLNDKRLY 


WYQKNGEDCV 


AFQFVIVNNK 


CLIMGEPAGD 


250 


DTYIREAIES 


FIDDADKLDY 


DLVFYSIGQK 


LTLLLHEYGF 


DFMKVGEDAL 


300 


VNLETFTLKG 


NKYKPFRNAL 


NRVEKDGFYF 


EVVQSPHSQE 


LLNSLEEISN 


350 


TWLEGRPEKG 


FSLGYFNKDY 


FQQAPIALVK 


NAEHEVVAFA 


NIMPNYEKSI 


400 


ISIDLMRHDK 


QKIPNGVMDF 


LFLSLFSYYQ 


EKGYHYFDLG 


MAPLSGVGRV 


450 


ETSFAKERMA 


YLVYHFGSHF 


YSFNGLHKYK 


KKFTPLWSER 


YISCSRSSWL 


500 


ICAICALLME 


DSKIKIVK 


(SEQ ID NO: 


6) 




518 






FIG. 


If 
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AATTTTGATA TCGAAACAAC AACTTTTGAG 
NFDI ET.T TFE 

AAAATATCTG TTGAGCGTTC TTTTATTGAA 
KISV ERS FIE 

CGTAAAGGAA TGCTGGCACT AATAGATAGT 
RKGM LAL IDS 

AATAGGGAAT TACAATTAAG CGCCTTTTTG 
NREL QLS AFL 

ACATCAGAAC AAGCTTGGGC ATCACTCATC 
TSEQ A W A SLI 

TTTTTAAAAA AATGGAAGAC ATCAACTCAC 
FLKK WKT STH 

GTTTATCGTA TTCGTGAACA AATGGGATTG 
VYRI REQ MGL 

ACTATAATAA AACAAGCGGA AGGTATTCGC 
TIIK QAE GIR 

AAAATAGAAC AACTAGATAG TGAGTTAGCA 
KIEQ LDS ELA 

GGTGGCACCT TAATCAAGAA ATTAGGAATA 
GGTL IKK LGI 

TCTCAAATTG AATTAGCCAT TGTTTTAGGA 
SQIE LAI VLG 

CATTTTGTTA AGCAGTACTT GATGGATTAG 
HFVK QYL MD 

TGGATTGACT AAGTCGGTTG GTGATAAGAC 
GLT KSVG DKT 

TAGTTTAGAC CGTATTGGGA TTATTGGTGT 
SLD RIGI IGV 

TGTTATTTCG GGTGAATTAG GTTTTGATGG 
VIS GELG FDG 

TTATAAGATT GCTTATTTAA AACAAGAACC 
YKI AYLK QEP 

CACCGTACTT TCTTCTGACT TAAGAGAGAT 
TVL SSDL REM 

TAATCACTAC GAAGAAAGTA AGCAATCACG 
NHY EESK QSR 

TTTAGATGCT TGGTCTATTG AGAGCGAAGT 
LDA WSIE SEV 

TGATTTGCAG TTGTCGGTTG GTGAATTATC 
DLQ L S V G ELS 



GCAATGAAAA AGCACGCGTC ATTATTGGAG 60 
AMKK HAS LLE 

TTTGATAAAC TTCTATTAGC ACCTTATTGG 120 
FDKL LLA PYW 

CATGCTTTTA ATTATCTACC ATGCTTAAAA 18 0 

HAFN YLP CLK 

TCCCAGTTAG ATAAAGATTT TTTATTTGAG 240 
SQLD KDF LFE 

TTGAGTATGG AAGTTGAACA CACAAAGACT 300 
LSME VEH TKT 

TTTCAAAAAG ATGTTGAGCA TATAGTGGAT 360 
FQKD VEH IVD 

GCTAAAGAAC ATCTTTATCG TTATGGAAAA 4 20 
AKEH LYR YGK 

AAAGCAAGAG GCTTGATGGT TGATTTCGAA 480 
KARG LMV DFE 

AT C CAT GAT A GGCATGAGAT AGTTGTCAAT 54 0 

IHDR HEI VVN 

AAACCTGGTC CACAGATGGG AGATATTATC 600 
KPGP QMG DII 

CAACTGATTA AT GAAGAAGA GGCTATTTTA 660 
QLIN EEE AIL 

AGAGGATTAT ATGAGCGATT TTTTAGTAGA 720 
M S D F LVD 
| > 

GGTCTTTAGT AATGTTTCAT T TAT CAT CCA 780 
VFS NVSF I I H 

CAATGGAACT GGAAAGACAA CACTATTAGA 84 0 
NGT GKT T LLD 

TGATCGTTCC CCTTTTTCAT CAGCTAATGA 900 
DRS PFSS AND 

AGACTTTGAT GATTCTCAGA CAATTTTGGA 960 
DFD DSQT I L D 

GGCTTTAATT AAAGAATATG AATTATTGCT 1020 
A L I KEYE LLL 

TCTAGAGAAA GTAATGGCAG AAATGGATTC 1080 
LEK VMAE MDS 

CAAAACAGTA TTATCCAAAT TAGGTATTAC 114 0 
KTV LSKL GIT 

AGGAGGATTA CGAAGACGTG TTCAATTAGC 1200 
GGL RRRV QLA 
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GCAAGTATTA TTAAATGATG CAGATTTATT 

QVL LNDA DLL 

TATTGACACT ATTGCATGGT TAACGAATTT 

IDT IAWL TNF 

TATAACTCAT GATCGTTATT TTCTAGACAA 

ITH DRYF LDN 

GGCACAGATT ACAGAATATC AAGGCAATTA 

A Q I TEYQ GNY 

AGACGAGCGT GATGCTGCTA GTTTACATAA 

DER DAAS LHK 

TTGGATGCGT ACTCAGCCAC AAGCTCGTGC 

WMR TQPQ ARA 

TCAAAATCTA AAAAAC GAT T TACACCAAAC 

QNL KNDL HQT 

TGAAACAAGT CGAATTGGGA AAAAGGTTAT 

E T S RIGK KVI 

AGATAAATCT ATCTTGAAAG ACTTTAATTT 

DKS ILKD F N L 

CGTTGGAGAT AATGGTGTTG GAAAGTCAAC 

VGD NGVG KST 

GCCGGATTCG GGTAATGTCT CTATTGGTGA 

PDS GNVS IGE 

ACTTCATAAT ATGGATGGCT CAAAACGTGT 

LHN MDGS KRV 

GGTTAAAACT AGTGTCGGTA CAACAAGTGT 

VKT SVGT TSV 

ACGTTCGACA CATGGAACAC AAATTGCAAA 

RST HGTQ I A K 

CCTTTTAAAA ATCCTGATTG AAAAGCCTAA 

LLK ILIE KPN 

CTTAGATATT GCTACATTAA CTGTTCTTGA 

LDI ATLT VLE 

GATTACAGTT AGTCACGATC GTTACTTTTT 

ITV SHDR YFL 

TGAAGATAAC GATATCCGTG AATTTTTTGG 

EDN DIRE FFG 

ATTTAATGAG CAAAATAATG AAGTTATCAG 

FNE QNNE VIS 

AAAGCAAAGT CGTAAAAGAA TGTCTTACTT 

KQS RKRM SYF 

CGATATTATG ATATTGGAAA ATACTATCAC 



GCTCTTAGAC GAACCTACTA ACCACTTAGA 1260 
LLD EPTN HLD 

TTTGAAAAAT AG T AAAAAG A CAGTGCTTTT 1320 
LKN SKKT VLF 

TGTTGCAACA CGTATTTTTG AATTAGATAA 1380 
VAT RIFE LDK 

TCAGGATTAT GTCCGACTTC GTGCAGAACA 14 4 0 
Q D Y V R L R A E Q 

AAAGAAACAG CTTTATAAAC AGGAACTAGC 1500 
KKQ LYKQ E L A 

AACGAAACAA CAGGCTCGTA TTAATCGTTT 1560 
TKQ Q A R I NRF 

AAGCGATACA AGCGATTTGG AAAT G AC AT T 1620 
SDT SDLE MTF 

TAATTTTGAA AATGTCTCTT TTTCTTACCC 1680 
NFE NVSF SYP 

GTTAATTCAA AATAAAGACC GTATTGGCAT 17 40 
LIQ NKDR IGI 

CTTACTTAAT TTAATTGTTC AAG AT T T AC A 1800 
LLN LIVQ DLQ 

AACGATACGT GTAGGTTACT TTTCACAACA 18 60 
TIR VGYF SQQ 

TATTAATTAT TTGCAAGAGG TTGCAGATGA 1920 
INY LQEV ADE 

GACAGAACTA TTGGAACAAT TTCTCTTTCC 1980 
TE L LEQF LFP 

ATTATCAGGT GGTGAGAAAA AAAGACTTTA 204 0 
LSG GEKK RLY 

TGTGTTACTA CTTGATGAGC C G AC AAAT G A 2100 
VLL LDEP TND 

AAATTTTTTA CAAGGCTTTG GTGGTCCTGT 2160 
NFL QGFG GPV 

AGATAAAGTG GCTAATAAAA TTATTGCGTT 2220 
DKV ANKI I A F 

TAATTATACT GATTATTTAG AT G AAAAAG C 2280 
NYT DYLD EKA 

TAAAAAAGAG AGTACCAAGA CAAGTCGTGA 234 0 
KKE STKT SRE 

TGAAAAACAA GAATGGGCGA CAATTGAAGA 24 00 
EKQ EWAT IED 

TCGTATAGAA AATGATATGC AAACATGTGG 24 60 
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DIM ILEN TIT RIE NDMQ TCG 

TAGTGATTTT ACAAGGTTAT CTGATTTACA AAAGGAATTA GATGCAAAAA ATGAAGCACT 2520 
S D F T.RLS DLQ KEL DAKN EAL 

TCTAGAAAAG TATGACCGTT ATGAGTACCT TAGTGAGTTA G AC AC AT GAT TATCCGTCCG 
LEK YDRY EYL SEL DTMI IRP 

ATTATTAAAA ATGATGACCA AGCAGTTGCA CAATTAATTC GACAAAGTTT ACGCGCCTAT 
I I K N DDQ AVA QLIR QSL RAY 

GATTTAGATA AACCTGATAC AGCATATTCA GACCCTCACT TAGATCATTT GACCTCATAC 2700 
DLDK PDT AYS DPHL DHL TSY 

TACGAAAAAA TAGAGAAGTC AGGATTCTTT GTCATTGAGG AG AG AG AT G A GATTATTGGC 27 60 
YEKI EKS GFF VIEE RDE IIG 

TGTGGCGGCT TTGGTCCGCT GAAAAATCTA ATTGCAGAGA TGCAGAAGGT GTACATTGCA 2820 
CGGF GPL KNL IAEM QKV Y I A 

GAACGTTTCC GTGGTAAGGG GCTTGCTACT GATTTAGTGA AAATGATTGA AGTAGAAGCT 2880 
ERFR GKG LAT DLVK MIE VEA 

CGAAAAATTG GGTATAGACA ACTTTATTTA GAGACAGCCA GTACTTTGAG TAGGGCAACT 294 0 
RKIG YRQ LYL ETAS TLS RAT 

GCGGTTTATA AGCATATGGG ATATTGTGCC TTATCGCAAC CAATAGCAAA TGATCAAGGT 3000 
AVYK HMG YCA LSQP IAN DQG 

CATACAGCTA TGGATATTTG GATGATTAAA GATTTATAAG TTGAAAGTGG ATTAGTGAAC 3060 
HTAM D I W MIK DL 

ATGGATTAAT TATTTTGAGA TAAGAGGAAA GAAAAGGAGA CATATATGGC ATATATTTGG 

MA Y I W 
|— > 

TCTTATTTGA AAAGGTACCC CAATTGGTTA TGGCTTGATT TACTAGGAGC TATGCTTTTT 

SYLK RYP NWL WLDL LGA MLF 

GTGACGGTTA TCCTAGGAAT GCCCACAGCC TTAGCGGGTA TGATTGATAA TGGCGTTACA 3240 
VTVI LGM PTA LAGM IDN GVT 

AAAGGTGATC GGACTGGAGT TTATCTGTGG ACGTTCATCA TGTTTATATT TGTTGTACTA 3300 
KGDR TGV Y LW TFIM FIF VVL 

GGTATTATTG GGCGTATTAC GATGGCTTAC GCATCTAGTC GCTTAACGAC AACAATGATT 3360 
GIIG RIT MAY ASSR LTT TMI 

AGAGATATGC GTAATGATAT GTATGCTAAG CTTCAAGAAT ACTCCCATCA TGAATATGAA 3420 
RDMR NDM YAK LQEY SHH E.YE 

CAGATAGGTG TATCTTCACT AGTGACACGT ATGACAAGCG ATACTTTTGT TTTGATGCAA 34 80 
Q I G V SSL VTR MTSD TFV LMQ 

TTTGCTGAAA TGTCTTTACG TTTAGGCCTA GTAACTCCTA TGGTAATGAT TTTTAGCGTG 354 0 
FAEM SLR LGL VTPM V M I FSV 

GTTATGATAC TAATTACGAG TCCATCTTTG GCTTGGCTTG TAGCGGTTGC GATGCCTCTT 3600 
VMIL ITS PSL AWLV AVA MPL 

TTGGTAGGAG TCGTTTTATA TGTAGCTATA AAAACAAAAC CTTTATCTGA AAGACAACAG 3660 
LVGV VLY VAI KTKP LSE RQQ 
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ACTATGCTTG ATAAAATCAA TCAATATGTT CGTGAAAATT TAACAGGGTT ACGCGTTGTT 3720 
TMLD KIN QYV RENL TGL RVV 

AGAGCCTTTG CAAGAGAGAA TTTTCAATCA CAAAAATTTC AAGTCGCTAA CCAACGTTAC 3780 
RAFA REN FQS QKFQ VAN QRY 

ACAGATACTT CAACTGGTCT TTTTAAATTA ACAGGGCTAA CAGAACCACT TTTCGTTCAA 384 0 
T D T S TGL F K L T G L T E P L F V Q 

ATTATTATTG CAATGATTGT GGCTATCGTT TGGTTTGCTT TGGATCCCTT ACAAAGAGGT 3900 
I I I A MIV AIV WFAL DPL QRG 

GCTATTAAAA TAGGGGATTT AGTTGCTTTT ATCGAATATA GCTTCCATGC TCTCTTTTCA 3960 
A I K I GDL VAF IEYS FHA LFS 

TTTTTGCTAT TTGCCAATCT TTTTACTATG TATCCTCGTA TGGTGGTATC AAGCCATCGT 4020 
F L L F ANL FTM YPRM VVS SHR 

ATTAGAGAGG TGATGGATAT GCCAATCTCT ATCAATCCTA ATGCCGAAGG TGTTACGGAT 4 080 
I R E V MDM PIS INPN AEG VTD 

ACGAAACTTA AAGGGCATTT AGAATTTGAT AATGTAACAT TCGCTTATCC AGGAGAAACA 4140 
TKLK GHL EFD NVTF AYP GET 

GAGAGTCCCG TTTTGCATGA TATTTCTTTT AAAGCTAAGC CTGGAGAAAC AATTGCTTTT 4200 
E S P V LHD ISF KAKP GET I A F 

ATTGGTTCAA CAGGTTCAGG AAAATCTTCT CTTGTTAATT TGATTCCACG TTTTTATGAT 4260 
I G S T GSG KSS LVNL IPR FYD 

GTGACACTTG GAAAAATCTT AGTAGATGGA GTTGATGTAA GAGATTATAA CCTTAAATCA 4 320 
VTLG KIL VDG VDVR DYN LKS 

CTTCGCCAAA AGATTGGATT TATCCCCCAA AAAGCTCTTT TATTTACAGG GACAATAGGA 4380 
LRQK IGF IPQ KALL FTG TIG 

GAGAATTTAA AAT AT GG AAA AGCTGATGCT AC TAT T GAT G AT C T TAG AC A AGCGGTTGAT 4440 
ENLK YGK ADA TIDD L RQ AVD 

ATTTCTCAAG CTAAAGAGTT TATTGAGAGT CACCAAGAAG CCTTTGAAAC GCATTTAGCT 4 500 
ISQA KEF IES HQEA FET H LA 

GAAGGTGGGA GCAATCTTTC TGGGGGTCAA AAACAACGGT TATCTATTGC TAGGGCTGTT 4 560 
EGGS NLS GGQ KQRL S I A RAV 

GTTAAAGATC CAGATTTATA TATTTTTGAT GATTCATTTT CTGCTCTCGA T T AT AAG AC A 4 620 
VKDP DL Y IFD DSFS ALD YKT 

GACGCTACTT TAAGAGCGCG TCTAAAAGAA GTAACCGGTG ATTCTACAGT TTTGATAGTT 4 680 
DATL RAR LKE VTGD STV LIV 

GCTCAAAGGG TGGGTACGAT TATGGATGCT GATCAGATTA TTGTCCTTGA TGAAGGCGAA 47 4 0 
AQRV GTI MDA D Q I I VLD EGE 

ATTGTCGGTC GTGGTACCCA CGCTCAATTA ATAGAAAATA ATGCTATTTA TCGTGAAATC 4800 
IVGR GTH AQL IENN A I Y REI 

GCTGAGTCAC AACT GAAGAA CCAAAACTTA TC AG AAG GAG AGTGATTGTA TGAGAAAAAA 4 8 60 
A E S Q LKN QNL SEGE M RKK 

1 > 
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ATCTGTTTTT TT GAG AT TAT GGTCTTACCT AACTCGCTAC AAAGCTACTC TTTTCTTAGC 4 920 

S V F LRLW SYL TRY KATL FLA 

GATTTTTTTG AAAGTTTTAT CTAGTTTTAT GAGTGTTCTG GAGCCTTTTA TTTTAGGGTT 4 980 

I F L KVLS SFM SVL EPFI LGL 

AGCGATAACA GAGTTGACTG CTAACCTTGT TGATATGGCT AAGGGAGTTT CTGGGGCAGA 504 0 

AIT ELTA NLV DMA KGVS GAE 

ATTGAACGTT CCTTATATTG CTGGTATTTT GATTATTTAT TTTTTCAGAG GTGTTTTCTA 5100 

LNV PYIA GIL IIY FFRG VFY 

TGAATTAGGT TCTTATGGCT CAAATT (SEQ ID NO: 7} 5126 

ELG SYGS N 



FIG. 2a 
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NFDIETTTFE 


AMKKHASLLE 


KISVERSFIE 


FDKLLLAPYW 


RKGMLALIDS 


50 


HAFNYLPCLK 


NRELQLSAFL 


SQLDKDFLFE 


TSEQAWASLI 


LSMEVEHTKT 


100 


FLKKWKTSTH 


FQKDVEHIVD 


VYRIREQMGL 


AKEHLYRYGK 


TIIKQAEGIR 


150 


KARGLMVDFE 


KIEQLDSELA 


IHDRHEI VVN 


GGTLIKKLGI 


KPGPQMGDII 


200 


SQIELAIVLG 


QLINEEEAIL 


HFVKQYLMD 


(SEQ ID NO 


:8) 


229 






FIG. 2b 






MSDFLVDGLT 


KSVGDKTVFS 


NVSFIIHSLD 


RIGIIGVNGT 


GKTTLLDVIS 


50 


GELGFDGDRS 


PFSSANDYKI 


AYLKQEPDFD 


DSQTILDTVL 


SSDLREMALI 


100 


KEYELLLNHY 


EESKQSRLEK 


VMAEMDSLDA 


WSIESEVKTV 


LSKLGITDLQ 


150 


LSVGELSGGL 


RRRVQLAQVL 


LNDADLLLLD 


EPTNHLDIDT 


IAWLTNFLKN 


200 


SKKTVLFITH 


DRYFLDNVAT 


RIFELDKAQI 


TEYQGNYQDY 


VRLRAEODER 


250 


DAASLHKKKQ 


LYKQELAWMR 


TQPQARATKQ 


QARINRFQNL 


KNDLHOTSDT 


J u u 


SDLEMTFETS 


RIGKKVINFE 


NVSFSYPDKS 


ILKDFNLLIQ 


NKDRIGIVGD 


350 


NGVGKSTLLN 


LIVQDLQPDS 


GNVSIGETIR 


VGYFSQQLHN 


MDGSKRVINY 


400 


LQEVADEVKT 


SVGTTSVTEL 


LEQFLFPRST 


HGTQIAKLSG 


GEKKRLYLLK 


450 


ILIEKPNVLL 


LDEPTNDLDI 


ATLTVLENFL 


QGFGGPVITV 


SHDRYFLDKV 


500 


ANKIIAFEDN 


DIREFFGNYT 


DYLDEKAFNE 


QNNEVISKKE 


STKTSREKQS 


550 


RKRMSYFEKQ 


EWATIEDDIM 


ILENTITRIE 


NDMQTCGSDF 


TRLSDLQKEL 


600 


DAKNEALLEK 


YDRYEYLSEL 


DT (SEQ ID NO: 9) 




622 






FIG. 2c 






MIIRPIIKND 


DQAVAQLIRQ 


SLRAYDLDKP 


DTAYSDPHLD 


HLTSYYEKIE 


50 


KSGFFVIEER 


DEIIGCGGFG 


PLKNLIAEMQ 


KVYIAERFRG 


KGLATDLVKM 


100 


IEVEARKIGY 


RQLYLETAST 


LSRATAVYKH 


MGYCALSQPI 


ANDQGHTAMD 


150 



IWMIKDL (SEQ ID NO: 10) 

FIG. 2d 
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MAYIWSYLKR 


YPNWLWLDLL 


GAMLFVTVIL 


GMPTALAGMI 


DNGVTKGDRT 


50 


GVYLWTFIMF 


IFVVLGIIGR 


ITMAYASSRL 


TTTMIRDMRN 


DMYAKLQEYS 


100 


HHEYEQIGVS 


SLVTRMTSDT 


FVLMQFAEMS 


LRLGLVTPMV 


MIFSVVMILI 


150 


TSPSLAWLVA 


VAMPLLVGVV 


LYVAIKTKPL 


SERQQTMLDK 


INQYVRENLT 


200 


GLRVVRAFAR 


ENFQSQKFQV 


ANQRYTDTST 


GLFKLTGLTE 


PLFVQIIIAM 


250 


IVAIVWFALD 


PLQRGAIKIG 


DLVAFIEYSF 


HALFSFLLFA 


NLFTMYPRMV 


300 


VSSHRIREVM 


DMPISINPNA 


EGVTDTKLKG 


HLEFDNVTFA 


YPGETESPVL 


350 


HDISFKAKPG 


ETIAFIGSTG 


SGKSSLVNLI 


PRFYDVTLGK 


ILVDGVDVRD 


400 


YNLKSLRQKI 


GFIPQKALLF 


TGTIGENLKY 


GKADATIDDL 


RQAVDISQAK 


450 


EFIESHQEAF 


ETHLAEGGSN 


LSGGQKQRLS 


IARAVVKDPD 


LYIFDDSFSA 


500 


LDYKTDATLR 


ARLKEVTGDS 


TVLIVAQRVG 


TIMDADQIIV 


LDEGEIVGRG 


550 


THAQLIENNA 


IYREIAESQL 


KNQNLSEGE 


(SEQ ID NO: 


11) 


579 






FIG. 2e 






MRKKSVFLRL 


WSYLTRYKAT 


LFLAIFLKVL 


SSFMSVLEPF 


ILGLAITELT 


50 


ANLVDMAKGV 


SGAELNVPYI 


AGILIIYFFR 


GVFYELGSYG 


SN 


92 


(SEQ ID NO: 


12) 














FIG. 2f 
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AATTTGGAAG TGCTCTATCA ACAGTTGAAG 
FGS ALS TVEV 

GGTTATATCG GCTCAGTTGC TGCCATTTTA 
LYR LSC CHFT 

GGTAAGCATC ATATGGGTCT AGCAACAAAG 
M G L A T K 

I — > 

AAAGGTAAGG CAAAAGCCCC TAAAACAAAC 
KGKA KAP KTN 

GGCATCTCTG CTGAACAGAT CGTAGTCAAA 
GISA E Q I VVK 

GGTGACCATT ATCATTTTTA CAATGGGAAA 
GDHY HFY NGK 

TTGTTGATGA CGGATCCTAA TTACCGTTTT 
LLMT D P N YRF 
| > 

GACGGTTACG TTATTAAAGT CAATGGCAAC 
DGYV I K V NGN 

CGCAAAAACA TTCGAACCAA ACAACAAATT 
RKNI RTK Q Q I 

GCTAAAGAAA AAGGTTTAGC TCAAGTGGCC 
AKEK GLA Q V A 

AATGAAGCAA AAAGACAAGG ACGCTATACT 
NEAK RQG RYT 

GAT AT CAT TG ATGATTTAGG AGATGCTTAT 
DIID DLG DAY 

ATTCCTAAAA AGGATTTGTC TCCAAGTGAG 
IPKK DLS PSE 

AAACAAGGTC GAGGTGCTAG ACCGTCTGAT 
KQGR GAR PSD 

AAAGCCCCAA TTCCTGATGT GACGCCTAAC 
KAPI PDV TPN 

GGC TAT CATC CAGCGCCTCC TAGGCCAAAT 
GYHP APP RPN 

GAGTTTAAAG GAAAAACCTT TAAGGAACTT 
EFKG KTF KEL 

TACCGTCATG TGGAAGAAGA TGGGTTGATT 
YRHV EED G L I 

GCTTTTGGGT ATGTGGTGCC TCATGGAGAT 
AFGY VVP HGD 

TCACCTCTTG AAATGGAATT AGCAGATCGA 
SPLE MEL ADR 

TCAGGTTCAG AG C AC T C AAA ACCATCAGAT 



TAAAGGAGAT TATTAGTGAA GAAAACATAT 60 

K E I ISE ENIW 

CTAGCTACTC ATATTGGAAG TTACCAACTT 120 

SYS YWK LPTW 

GACAATCAGA TTGCCTATAT TGATGACAGC 180 

DNQI A Y I DDS 

AAAACGATGG ATCAAATCAG TGCTGAAGAA 240 

KTMD QIS AEE 

ATTACTGACC AAGGCTATGT GACCTCACAC 300 

ITDQ GYV TSH 

GTTCCTTATG ATGCGATTAT TAGTGAAGAG 360 

VPYD All SEE 

AAACAATCAG ACGTTATCAA TGAAATCTTA 4 20 

KQSD VIN EIL 

TATTATGTTT ACCTCAAGCC AGGTAGTAAG 4 80 

YYVY LKP GSK 

GCTGAGCAAG TAGCCAAAGG AACTAAAGAA 54 0 

AEQV AKG TKE 

CATCTCAGTA AAGAAGAAGT TGCGGCAGTC 600 

HLSK E EV AAV 

AC AG AC GAT G GCTATATTTT TAGTCCGACA 660 

TDDG YIF SPT 

TTAGTACCTC ATGGTAATCA CTATCATTAT 720 

LVPH GNH YHY 

CTAGCTGCTG CACAAGCCTA CTGGAGTCAA 780 

L A A A QAY WSQ 

TACCGCCCGA CACCAGCCCC AGGTCGTAGG 8 40 

YRPT PAP GRR 

CCTGGACAAG GTCATCAGCC AGATAACGGT 900 

PGQG HQP DNG 

GATGCGTCAC AAAACAAACA CCAAAGAGAT 960 

DASQ NKH QRD 

TTAGATCAAC TACACCGTCT TGATTTGAAA 1020 

LDQL HRL DLK 

TTTGAACCGA CTCAAGTGAT CAAATCAAAC 108 0 

FEPT QVI KSN 

CATTATCATA TTATCCCAAG AAGTCAGTTA 114 0 

HYHI IPR SQL 

TACTTAGCTG GCCAAACTGA GGACAATGAC 1200 

YLAG QTE DND 

AAAGAAGTGA CACATACCTT TCTTGGTCAT 12 60 
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S G S E H S K 

CGCATCAAAG CTTACGGAAA 
RIKA Y G K 

GTTTTTAGTA AAGAATCCAT 

V F S K ESI 

GATCATTTCC ACTATATAGG 
D H F H Y I G 

AACTGGGTGA AAGCAAAAGG 
N W V K A K G 

GGCAAAGAAA AACCACTCTT 
G K E K P L F 

AAAGTGGGCT ATATGATGCC 
K V G Y M M P 

GATTTGACTC AGATTGCCTT 
D L T Q I A F 

CGTTATGACA TTGTTGACAC 
R Y D I V D T 

CCGATGCATG CTGGTAATGC 
P M H A G N A 

GATCATATCC ATGTCGTTCC 
D H I H V V P 

TATGTGATGC AACACCCCGA 

Y V M Q H P E 

TCAGGTTCGG TCATTCCAAA 
S G S V I P N 

CAAATTATCC ATTCTGCTGA 
Q I I H S A E 

CCAGACGGCT ATATTTTCGA 
P D G Y I F D 

GATGGCTCCT TTAGCATCCC 
D G S F SIP 

GATCTATCCC AAGCTGAGTG 
D L S Q A E W 

GAT GC TACT G AT AC GG AT AA 
D A T D TDK 

CAACAGCCAA GTGAAGCCAG 
Q Q P S E A S 

CCAGACTATG GTCTAGATAG 
P D Y G L D R 

GCTAATATCG ATCCTAAGTA 
A N I D P K Y 



PSD K E V T 

AGGCTTAGAT GGTAAACCAT 
G L D G K P Y 

TCATTCAGTG GATAAATCAG 
H S V D K S G 

ATTTGGAGAA CTTGAACAAT 
F G E L E Q Y 

TCAAGCTGAT GAGCTTGCTG 
Q A D E L A A 

TG AC ACT AAA AAAGTGAGTC 
DTK K V S R 

AAAAGATGGT AAGGACTATT 
K D G K D Y F 

TGCCGAACAA GAACTAATGC 
A E Q E L M L 

AGGTATTGAG CCACGACTTG 
G I E P R L A 

TACTTACGAT ACTGGAAGTT 
T Y D T G S S 

GTATTCATGG TTGACGCGCG 

V S W L T R D 

AGTTCGTCCG GATGTATGGT 

V R P D V W S 

TGTTACGCCT CTTGATAAAC 

V T P L D K R 

AGAAGTTCAA AAAGCCCTAG 
E V Q K A L A 

TCCACGAGAT GTTTTGGCCA 
P R D V L A K 

AAGAGCAGAT GGCAGTTCAT 
RAD G S S L 

GCAACAAGCT CAAGAGTTAT 
Q Q A Q E L L 

ACCCAAAGAA AAGCAACAGG 
P K E K Q Q A 

TAAAGAAGAA AAAGAATCAG 
K E E K E S D 

AGCAACCCTA GAAGATCATA 
A T L E D H I 

TCTCATTTTC CAACCAGAAG 
L I F Q P E G 



PCT/CA99/00114 



H T F L G H 

AT GAT AC GAG TGATGCTTAT 1320 
D T S DAY 

GAGTTACAGC TAAACACGGA 1380 
VTA K H G 

ATGAGTTGGA TGAGGTCGCT 144 0 
ELD EVA 

CTGCTTTGGA TCAGGAACAA 1500 
A L D Q E Q 

GCAAAGTAAC AAAAGATGGT 15 60 
K V T K D G 

TCTATGCTCG TGATCAACTT 1620 

Y A R D Q L 

TTAAAGATAA GAAGCATTAC 1680 
K D K K H Y 

CTGTAGATGT GTCAAGTCTG 1740 

V D V SSL 

CGTTTGTTAT CCCACATATT 1800 
F V I PHI 

ATCAGATTGC AACAGTCAAG 18 60 
Q I A T V K 

CTAAGCCAGG GCATGAAGAG 1920 
K P G H E E 

GTGCTGGTAT GCCAAACTGG 1980 
A G M P N W 

CAGAAGGTCG TTTTGCAACA 2040 
E G R FAT 

AAGAAACTTT TGTATGGAAA 2100 
E T F V W K 

TGAGAACCAT TAATAAATCT 2160 
R T I N K S 

TGGCAAAGAA AAATACTGGT 2220 
A K K N T G 

CAGATAAGAG CAATGAAAAC 2280 
D K S N E N 

ATGACTTTAT AGACAGTTTA 234 0 
D F I DSL 

TCAATCAATT AGCACAAAAA 2 4 00 
N Q L A Q K 

GTGTCCAATT TTATAATAAA 24 60 
V Q F Y N K 
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AATGGTGAAT TGGTAACTTA TGATATCAAG 
NGEL VTY DIK 

GAAGATCTCA TTGTTAAAGC ACTGCTTTGT 

TCTATGTAAC GAGTAGTGAT AAAAGTTGGA 

TATCCATGTT AGAATAGTAA AAAAAGAGGA 

CAGAAAACTG TGTTATTTTA TTGCGTTAAA 
• K I A N F 

TCCTAGATTA GCCGTATGTG GGTTGTAATT 
GLN ATH PNYN 

GTCTAATTGA ACCTGTTTGA TATTTTGATA 
DL'Q VQK INQY 

ATACTTGAAA AATGCTTCAG TTACGGCATT 
YKF FAE TVAN 

AT GC AT GAT A TTGGCACTGC ACCCTAATAG 
H M 

< I 

CAGTTTTCTG TACTGTACAG GCGACTGGTC 
LKR YQV PSQD 

ATAAAATGTA ATGTAATTTT TAACAATATT 
YFT IYN KVIN 

ATTATGGAAA TAAAAGGTTT CAGTCTTTAG 
NHF YFT ETKL 

ATCTGCAGGT GTTCCTTTTC GAGACATTGA 
DAP TGK RSMS 

GTAGTAAGCC ATAGAAGTAT ACACTGAGCC 
Y Y A M 
<— -| 

TAGGCAATTT TAACTGATTA AGGGTGTCTA 
KPLK LQN LTD 

TAGTGTAAGC TATAATTTCT CGGTTATAGA 
ITYA I I E RNY 

AGTTACCGAA ATATAGGTAG GTAATATCTG 
CNGF YLY TID 

GGAAATCCCG ACTCAATTTA TTATCTGTTA 
HGDR SLK NDT 

TCTTGGTACG TGTCCGACAA AGCCAGCCAT 
KKTR TRC LWG 

TATTAACAGT CAATCCGTGG ATTTTTTTGA 
TNVT LGH IKK 

TAAAGTGATT CTCCATACAG AGCTGTTCAA 
I F H N EM 



ACACTTCAAC 


AAATAAACCC 


TTAACCAAAA 


2520 


T L Q Q 


I N P 






CAAAGCAAGT 


TACGGTGATT 


TTGAAGTCAT 


2580 


TAATAGCGGT 


TTTCTTTTGC 


AAAGAAATGG 


2640 


GGATTCTTGG 


ACTAATGTCA 


AATAAGTAGA 


? 7 nn 


ATAATTTTCT 


TCTTTCTGAT 


TAGGGGTTAG 


2760 


Y N E 


E K Q 


N P T L 




GTTATAAAAA 


TTCTCAATGT 


ATTCAAAGCA 


2820 


N Y F 


N E I 


Y E F C 




ATGTTTTCGG 


TTGATTTGTC 


TATGCTTTAA 


2880 


H K R 


N I Q 


R H K L 




ATCATAAGGA 


TATCCAGGAT 


TAGAAAAAGA 


2940 


D Y P 


Y G P 


N S F S 




T G AG AC GC AA 


G AAAAAC AC T 


TTTAGGCAAT 


3000 






A I 




GTTTAATCTC 


TGTTGAATTC 


TAGTTTCATT 


3060 


N L R 


Q Q I 


R T E N 




TGTTATACTA 


TCTTTGTTGT 


ATTTTCTCCT 


3120 


T I S 


D K N 


Y K R R 




GACGGTGTGA 


AACCATTCAA 


TACAGGCATT 


3180 


V T H 


F W E 


I C A N 




GCGGATAATG 


TCTTTTTCCG 


TGCAAGCCTG 


3240 


R I I 


D K E 


T C A Q 




TTGGTCACTG 


TGTAAGATTG 


CTCCTTTATT 


3300 



GTACAAAATC CGTGTCCTGA CAATCTGAGA 3360 
LVFD TDQ CDS 

GATTCATAAT TGATGAGAGA TACAATTTAC 34 20 
LNMI SSL YLK 

TTACGAGCTT TTCCTTAGGC TTATCGGCAT 34 80 
TVLK EKP KDA 

AATAATAAGC TTTACCCAAA TTGGGAACTT 354 0 
LYYA KGL NPV 

TATTTTTCAT GATACGATAG ACTTTCTTTG 3600 
NNKM IRY VKK 

GCAATCGTGT AATGGTACGA TAGCCATAAA 3660 
LLRT ITR YGY 

TTAATTCAAT AAGGTCATCT TTTTTTGCGG 3720 
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CTTCTCATAC TCCTTTTTCC AACGGTAATA 

AATGAAAACT ATCGGGTAGT TGTTTTTATA 

ATCGATTTCC TTATCAAGCC TCGATACTTT 
I S K RILG RYK 

TCCACTTCAG ACAGATGTTC CAAGCCTTTA 
EVE SLHE LGK 

TGAAAACGAT AAAGCTCCTC GTTTTCGTAC 
HFR YLEE NEY 

TTTTTGATGC CTAAAGTCTC CATAATAACT 
NKI GLTE MIV 

ATGCAAGCCA GCTTAGTTTC CCATGAATAT 
ICA LKTE WSY 

TTCTAGTTTA CTAAATTTCA ACAGGAGTGT 
GCCTATTGTT GTCATCAATT ATTTTTCTAA 
CGGAATGAAA GAGAAGTGTT CCTTCAATCT 
TCTGTACAAC ATTTATAAAG TGTTTTTCTA 



GTAGTTGAGA CTACCATGAA TGCGGTGGTA 
Y N L SGH IRH Y 

AAGAGCTAGT TCTTCCAGCA ATTGAAAGG? 
LAL EEL LQFT 

AC GAT AC GT A CTTTCAGCTA CGGCATTGTC 
RYT SEA VAND 

TGTGATTCCA AAGGCTTCCA ATATTTCATC 
TIG FAE LIED 

ATCTGAATGG AACATCTTGA CTTTGGTCAG 
DSH FMK VKTL 

AACGAGTTCA GCGGTCTTGT GCCAACCAAG 
VLE ATK HWGL 

GTCAATGATG AGGCAAACAT AAGCCCAACG 
D I I LCV YAWR 

GACTAAGGCT TGTAGTGGTC TTTCTTGCTT 
VLA QLP RE QK 

GGCTTCATTC TTGCCTCTAG AATGTGGTTT 
AEN KGR SHPK 

ATTGAGTCGC TTCATAATGC GTCGAATCCG 
NLR KMI RRIR 

CAAGCATATT TTGATTTTTC TGGATCCGTA 
LCI KIK RSGY 



GGTCGACCGC TTGACCTTAA AACAGTCTAG 37 8 0 

GTCTTCCACA AGCTTGATAA GACTTACTTT 38 40 

. K 

TTTAAGAGGT CAACCTGTAA TTGTAATTGT 3900 
KLL DVQL QLQ 

CCGTAGGTAT ATTGCTTGCC AACACCTTGA 3 960 
GYT YQKG V G Q 

CATTTCATCC AAGTATAGAT TTGACTATTA 4 020 
WKM WTYI QSN 

CTGTTAGACT TGCCTGCTTT CTTCATATCG 4 080 
RNS KGAK KMD 

GCTTTTTTAA CCATAATAAA ACATTCCTGT 414 0 
A K K V M 
<-— | 

TTTTCTTTTG TCTCATTTTA GGGATTCAGT 4 200 

ATTCCCCGGA CTTAAATTGT GACCCTTGGT 4 2 60 

TTCTTTTATT AAGTGAAAAG GCAACACTTT 4 320 

GGCAATTAAT CTTTTAGTCA TTGGTGTTTG 4 380 
• AIL RKT MPTQ 

ATTCCACCAA TGAACATAGT CTTTAGTCTT 4 4 40 
NWW HVY DKTK 

TTCTTGATAA ACAAATTCAA TTTTGAAAGC 4 500 
EQY VFE IKFA 

ATAAGGATAA CCAGCCTGAC TAAGCGAACG 4 560 
YPY GAQ SLSR 

AATTAACTGA TTATCAAACT CTTTGCCACG 4 620 
I L Q NDF EKGR 

GGCGTAAGGG ATGCTTTGTA TGGCTTGCTT 4 68 0 
A Y P ISQ IAQK 

AG AC AG GCCG AT GAT T T C AC GGTTGTATAG 474 0 
SLG I I E R N Y L 

ATTGCCTACA CGAACATAGG TTAAGTCAGT 4 800 
NGV RVY TLDT 

AAATTGCCTG TCTAAGTGGT TGGGAATAGG 4 8 60 
F Q R DLH NPIP 

GAAGGTGGCT TTCTGATAAA CAGAAACCAA 4 920 
FTA KQY VSVL 

ACGACGTGAA AGTGTGATAC CTTCGTTATT 4 980 
RRS LTI GENN 

TCTAGACTCG CTATCGAGAA AAATTCTTTT 504 0 
RSE SDL FIRK 
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AATAGTTTCT TCAAACTCCG TTTCAGATAC TGACTCCACG GCTTGATAGT AATAACTTGA 5100 

ITE EFE TESV SEV AQY YYSS 

GTGTGGCATA TTCAGCCAGC GACACATCTT TGAAATGCTG TATTTATCCT TAT TAG C AG T 5160 

HPM NLW RCMK SIS YKD KNAT 

GATTATTTCC CTTTTTGTGC CATAATCACC GCTGCTTGCT TTAGGATATC TAATT 521 5 

I I E RKT GYDG SSA KPY RI 

(SEQ ID NO:13) < | 

FIG. 3a 



FGSALSTVEV KEIISEENIW LYRLSCCHFT SYSYWKLPTW 40 
(SEQ ID NO: 14) 

FIG. 3b 



JxlCj Li A 1 K UN Q I 


AYIDDSKGKA 


KAPKTNKTMD 


QISAEEGISA 


EQIVVKITDQ 


50 


GYVTSHGDHY 


HFYNGKVPYD 


AIISEELLMT 


DPNYRFKQSD 


VINEILDGYV 


100 


IKVNGNYYVY 


LKPGSKRKNI 


RTKQQIAEQV 


AKGTKEAKEK 


GLAQVAHLSK 


150 


EEVAAVNEAK 


RQGRYTTDDG 


YIFSPTDIID 


DLGDAYLVPH 


GNHYHYIPKK 


200 


DLSPSELAAA 


QAYWSQKQGR 


GARPSDYRPT 


PAPGRRPCAPI 


PDVTPNPGQG 


250 


HQPDNGGYHP 


APPRPNDASQ 


NKHQRDEFKG 


KTFKELLDQL 


HRLDLKYRHV 


300 


EEDGLIFEPT 


QVIKSNAFGY 


VVPHGDHYHI 


IPRSQLSPLE 


MELADRYLAG 


350 


QTEDNDSGSE 


HSKPSDKEVT 


HTFLGHRIKA 


YGKGLDGKPY 


DTSDAYVFSK 


400 


ESIHSVDKSG 


VTAKHGDHFH 


YIGFGELEQY 


ELDEVANWVK 


AKGQADELAA 


450 


ALDQEQGKEK 


PLFDTKKVSR 


KVTKDGKVGY 


MMPKDGKDYF 


YARDQLDLTQ 


500 


IAFAEQELML 


KDKKHYRYDI 


VDTGIEPRLA 


VDVSSLPMHA 


GNATYDTGSS 


550 


FVIPHIDHIH 


VVPYSWLTRD 


QIATVKYVMQ 


HPEVRPDVWS 


KPGHEESGSV 


600 


IPNVTPLDKR 


AGMPNWQIIH 


SAEEVQKALA 


EGRFATPDGY 


IFDPRDVLAK 


650 


ETFVWKDGSF 


SIPRADGSSL 


RTINKSDLSQ 


AEWQQAQELL 


AKKNTGDATD 


700 


TDKPKEKQQA 


DKSNENQQPS 


EASKEEKESD 


DFIDSLPDYG 


LDRATLEDHI 


750 


NQLAQKANID 


PKYLIFQPEG 


VQFYNKNGEL 


VTYDIKTLQQ 


INP 


793 


(SEQ ID NO: 


15) 











FIG. 3c 
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MTDPNYRFKQ 


SDVINEILDG 


YVIKVNGNYY 


VYLKPGSKRK 


NIRTKQQIAE 


50 


QVAKGTKEAK 


EKGLAQVAHL 


SKEEVAAVNE 


AKRQGRYTTD 


DGYIFSPTDI 


100 


IDDLGDAYLV 


PHGNHYHYIP 


KKDLSPSELA 


AAQAYWSQKQ 


GRGARPSDYR 


150 


PTPAPGRRKA 


PIPDVTPNPG 


QGHQPDNGGY 


HPAPPRPNDA 


SQNKHQRDEF 


200 


KGKTFKELLD 


QLHRLDLKYR 


HVEEDGLIFE 


PTQVIKSNAF 


GYVVPHGDHY 


250 


HIIPRSQLSP 


LEMELADRYL 


AGQTEDNDSG 


SEHSKPSDKE 


VTHTFLGHRI 


300 


KAYGKGLDGK 


PYDTSDAYVF 


SKESIHSVDK 


SGVTAKHGDH 


FHYIGFGELE 


350 


QYELDEVANW 


VKAKGQADEL 


AAALDQEQGK 


EKPLFDTKKV 


SRKVTKDGKV 


400 


GYMMPKDGKD 


YFYARDQLDL 


TQIAFAEQEL 


MLKDKKHYRY 


DIVDTGIEPR 


450 


LAVDVSSLPM 


HAGNATYDTG 


SSFVIPHIDH 


IHVVPYSWLT 


RDQIATVKYV 


500 


MQHPEVRPDV 


WSKPGHEESG 


SVIPNVTPLD 


KRAGMPNWQI 


IHSAEEVQKA 


550 


LAEGRFATPD 


GYIFDPRDVL 


AKETFVWKDG 


SFSIPRADGS 


SLRTINKSDL 


600 


SQAEWQQAQE 


LLAKKNTGDA 


TDTDKPKEKQ 


QADKSNENQQ 


PSEASKEEKE 


650 


^nnFTn^T pn 

OUUC 1 iJO JjiT U 


YGLDRATLED 


HINQLAQKAN 


IDPKYLIFQP 


EGVQFYNKNG 


700 


ELVTYDIKTL 


QQINP (SEQ ID N0:16) 






715 






FIG. 3d 






MH^VQMPfYD 

iin or oiNrbx ir 


YDNAVTEAFF 


KYLKHRQINR 


KHYQNIKQVQ 


LDCFEYIENF 


50 


YNNYNPHTAN 


LGLTPNQKEE 


NYFNAIK (SEQ ID NO: 17) 


77 






FIG. 3e 






MAYYQACTEK 


DIIRSMSRKG 


TPADNACIEW 


FHTVLKTETF 


YFHNRRKYNK 


50 


DSITNIVKNY 


ITFYNETRIQ 


QRLNDQSPVQ 


YRKLIA (SEQ ID NO: 18) 


86 






FIG. 3f 






MENHFIYGYR 


TITRLLKKIH 


GLTVNTKKVY 


RIMKNNGWLC 


RTRTKKVPNL 


50 


GKAYYLTDNK 


LSRDFHADKP 


KEKLVTDITY 


LYFGNCKLYL 


SSIMNLYNRE 


100 



IIAYTISDCQ DTDFVLDTLN QLKLPK (SEQ ID NO: 19) 

FIG. 3g 
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MVKKAYSWET KLAC I DMKKA GKSNRVIMET LGIKNNSQIY TWMKWYENEE 
LYRFHQGVGK QYTYGKGLEH LSEVEQLQLQ VDLLKKYRGL IRKSIK 
(SEQ ID NO:20) 

FIG . 3h 



IRYPKASSGD YGTKREIITA NKDKYSISKM CRWLNMPHSS YYYQAVESVS 50 

ETEFEETIKR IFLDSESRYG SRKIKICLNN EGITLSRRRI RRIMKRLNLV 100 

SVYQKATFKP HSRGKNEAPI PNHLDRQFKQ ERPLQALVTD LTYVRVGNRW 150 

AYVCLIIDLY NREIIGLSLG WHKTAELVKQ AIQSIPYALT KVKMFHSDRG 200 

KEFDNQLIDE ILEAFGITRS LSQAGYPYDN AVAESTYRAF KIEFVYQETF 250 

QLLEELALKT KDYVHWWNYH RIHGSLNYQT PMTKRLIA (SEQ ID NO:21)288 

FIG. 3i 



AATTTGAAAG CAGAATTATC TGTAGAAGAT 
NLKA ELS VED 

TCTGCTCATG GTTCAACACC ACAAGAAGGT 
SAHG STP QEG 

CTAAGTCAAT TTGATTTTGA AGGTCCTGCT 
LSQF DFE GPA 

ATTCACGAAG ACTTCTCAGG TGAAAAACTT 
IHED F S G EKL 

CCATTGAGCA TGAATGCAGG TGTCTTCCAG 
PLSM NAG VFQ 

GCTCTTAATT TCCGTTACCC ACAAGGGACA 
ALNF RYP QGT 

AAACTTAACG GAGTTGAAAA AGTGACTCTT 
KLNG VEK VTL 

CCTATGGACG ATGAATTAGT ATCAACCTTA 
PMDD ELV STL 

AAAGG AC AT G AACAGGTTAT TGGTGGTGGG 
KGHE QVI GGG 

GCATACGGTG CCATGTTCCC AGGAGATGAA 
AYGA MFP GDE 

CCTTTAGAAA ATATTTTCCG TTCGGCTGCT 



GAGCAATATA CAGCAACAGT TTATGGTAAA 60 
EQYT A T V YGK 

GTTAATGGGG CGACTTATTT AGCTCTTTAT 120 
VNGA TYL ALY 

CGTGCTTTCT TAGATGTTAC AGCCAACATT 180 
RAFL DVT A N I 

GGAGTAGCTT AT GAAGAT G A CTGTATGGGA 24 0 
GV AY EDD CMG 

TTTGATGAAA CTAATGATGA TAATACTATC 300 
FDET NDD NTI 

GATGCTAAAA CTATCCAAAC TAAGCTTGAG 360 
DAKT IQT-KLE 

TCTGACCATG AACACACACC ACACTATGTA 4 20 
SDHE HTP HYV 

CTAGCTGTCT AT G AAAAG C A AACTGGTCTT 4 80 
LAVY EKQ TGL 

ACATTTGGTC GCTTACTTGA ACGGGGTGTT 54 0 
TFGR LLE RGV 

AACACTATGC ATCAAGCTAA TGAGTACATG 600 
NTMH QAN EYM 

ATCTACGCAG AAGCTATCTA TGAATTAATC 660 
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PLEN I F R S A A 

AAATAAAATA ATCCTTAAAC TAAATATGTG 
K . 

AGTGTCTTTG CCTCTTTTCA TAAGGTTAGA 



ATTATTAAAG CAATAAAAAG TGATTCACAG 
I I K A IKS DSQ 

CCTTTGTTTG CTGCTCCTAA AACAGCTAGG 
PLFA APK TAR 

AAAACTCAAG AAGCAAGACT CTATTGGAAA 
KTQE ARL YWK 

CTTGGAGTTG ATGAAGAGAC ATTTTACCAT 
LGVD EET FYH 

TTTTATTACC CAGGCAAAGG AAAATCAGGA 
FYYP GKG KSG 

AAATGGCACC CTCTTATTTT AAAAGAAATG 
KWHP LIL KEM 

CAGTATGCTC AGAAAT AT T A TCTTGGAAGC 
QYAQ KYY LGS 

AAAGCTTACA AAGACTATCT ACCCGATTAT 
KAYK DYL PDY 

CAAATTTGGC TAAAGAAGAA TCCATGGTTT 
QIWL KKN PWF 

ATAGTAGCAG ATATTTTAAA AGATTAAGGA 
IVAD ILK D . 

ACACGTATTT TTCCTATGAT TGTCAAACGG 
TYF SYD CQTA 

GTGAATTTAT CACGACAGAA CATTTTGATT 
E F I TTE HFDL 

TTCCTGATTA TAGTGCTTAT TGTCAAAAAA 
PDY SAY CQKI 

GATTTAAAAA AGGAATTGAA ATCGGTTATT 
FKK GIE IGYF 

ATTTAAAAAA TAAAGAATTT GATTTAAAAC 
LKN KEF DLKL 

ATTATCTGCA AGAAGAAGCT CTGAAAGTAC 
YLQ EEA LKVP 

AATCGTATGG AATTTGCCAT AGGCCGTGTG 

GGTTTTCGTA AGTTAAACTT AGATGTAGAA 

CGCATTTTCA TAAAGATGTT ATCTAAGGGG 



IYAE A I Y ELI 

AT C AAT GAT A AAGGGTGGTG AAGACATGAA 720 



TTTGGAGACT TTATGACTGA CTTGGAAAAA 78 0 
M T D L E K 
| > 

AATCAAAATT ATACAGAAAA T GG TAT T GAT 840 
NQNY TEN G I D 

ATCAATATTG TTGGCCAAGC ACCTGGTTTA 900 
INIVGQAPGL 

GATAAATCTG GAGATCGTCT ACGCCAGTGG 960 
DKSG DRL RQW 

TCTGGAAAAT TTGCTGTTTT ACCTTTAGAT 1020 
SGKF A V L PLD 

GATTTACCCC CTAGAAAAGG TTTTGCGGAG 1080 
DLPP RKG FAE 

CCTAATGTTC AATTGACCTT GCTAGTTGGT 114 0 
PNVQ LTL LVG 

TCCGCACATA AAAATCTAAC AGAAACAGTT 1200 
SAHK NLT ETV 

TTACCCCTGG TTCACCCATC ACCGCGAAAT 1260 
LPLV HPS PRN 

GAAAAAGATC TAATCGTTGA TTTACAAAAG 1320 
EKDL IVD LQK 

TAGGAGTTGG TATGAGAGAT AAT CAT CT AC 1380 
M R D N H L H 
| > 

CATTTGAGGA C TAT ATT AAT GGTTTTACAG 14 4 0 
FED YIN GFTG 

TATCAAATCC TTACACCGGT CAAGACGATG 1500 
SNP Y TG Q DDV 

TAGATTATCT T AAT C AG AAA TATGGAAATC 1560 
DYL NQK YGNR 

T T AAAGAT AG GGAATCAGAT ATTTTAGATT 1620 
KDR ESD ILDY 

TATTGTCAAT CC AT CAT AAT GGTAGGTATG 1680 
LSI HHN GRYD 

CAACAAAGGG AGCTTTTAGC AG AT TACT TT 17 4 0 
TKG AFS RLL. 

GAAGCGCACG TTTTAGCTCA CTTTGATTAT 1800 

GATTTAAAAC CGTTTGAAAC GCAATTGAAG 18 60 

TTAGCTTTTG AACTAAATAC CAAATCCCTT 1920 
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TATCTATATG GGAATGAAAA ACTTTATCGC TATGCTTTAG AG AT ACT CAA ACAGCTTGGT 1980 

TGTAAACAAT ACTCTATAGG CTCTGACGGT CATATTCCTG AACATTTTTG TTATGAATTT 204 0 

GATAGACTTC AAGGTCTGCT AAAGGACTAT CAAATTGATG AAAATCATTT GATATGAGGA 2100 

AATTTTTGAT AAAAAAGCTA GGCAATATTG CTTAGCTTTT TTGTAATGCT ATTGATAGTT 2160 

TTAGTGAAAA TTTCAAAAAA ATAAAGAAAT CATTTACTTG TTGCAAGCGC TTGCGTAAAT 2220 

TGTTATGATT TTATTGGTAA CAATTCATTA AAAAAGGAGA ATGATATGAA AAGAAAAGAC 2280 

M K R K D 



TTATTTGGTG ATAAACAAAC TCAATACACG ATTAGAAAGT TAAGTGTTGG AGTAGCTTCA 2340 
KQT QYT IRKL SVG VAS 

GTTACAACAG GGGTATGTAT TTTTCTTCAT AGTCCACAGG TATTTGCTGA AGAAGTAAGT 2400 
Xt "CI F L H SPQV FAE EVS 

GTTTCTCCTG CAACTACAGC GATTGCAGAG TCGAATATTA ATCAGGTTGA CAACCAACAA 2460 
v a f a T T A IAE S N I N QVD NQQ 

TCTACTAATT TAAAAGATGA CATAAACTCA AACTCTGAGA CGGTTGTGAC ACCCTCAGAT 2520 
^ KDD INS NSET VVT PSD 

ATGCCGGATA CCAAGCAATT AGT AT CAGAT GAAACTGACA CTCAAAAGGG AGTGACAGAG 2580 
M ^ ui KQL VSD ETDT QKG VTE 

CCGGATAAGG CGACAAGCCT GCTTGAAGAA AATAAAGGTC CTGTTTCAGA TAAAAATACC 2640 
f u K A TSL LEE NKGP VSD K NT 

TTAGATTTAA AAGTAGCACC ATCTACATTG C AAAAT AC TC CCGACAAAAC TTCTCAAGCT 2700 
VAP STL QNTP DKT S QA 

ATAGGTGCTC CAAGCCCTAC CTTGAAAGTA GCTAATCAAG CTCCACGGAT TGAAAATGGT 27 60 
■ L ^ AP SPT LKV ANQA PRI ENG 

TACTTTAGGC TACATCTTAA AGAATTGCCT CAAGGTCATC CTGTAGAAAG CACTGGACTT 2820 
HLK ELP QGHP VES TGL 

TGGATATGGG GAGATGTTGA TCAACCGTCT AGTAATTGGC CAAATGGTGC TATCCCTATG 2880 
1WG DVD QPS SNWP NGA I P'm 

ACTGATGCTA AGAAAGATGA TTACGGTTAT TATGTTGATT TTAAATTATC TGAAAAACAA 2 94 0 
1UAK KDD YGY YVDF KLS EKQ 

CGAAAACAAA TATCTTTTTT AATTAATAAC AAAGCAGGGA CAAATTTAAG CGGCGATCAT 3000 
RKQI INN KAGT NLS GDH 

CATATTCCAT TATTACGACC TGAGATGAAC CAAGTTTGGA TTGATGAAAA GTACGGTATA 3060 
LRP EMN QVWI DEK YGI 

CATACTTATC AACCCCTCAA- AGAAGGGTAT GTCCGTATTA ACTATTTGAG TTCCTCTAGT 3120 
y EGY VR1N YLS SSS 

AACTATGACC ACTTATCAGC ATGGCTCTTT AAAGATGTTG CAACCCCYTC AACAACTTGG 3180 
IUH LSA WLF KDVA TPS TTW 

CCAGATGGTA GTAATTTTGT GAATCAAGGA CTATATGGAA GGTATATTGA TGTATCACTA 3240 
f o b s NFV NQG LYGR YID VSL 
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AAAACTAACG CCAAAGAGAT 
K T N A K E I 

GTGAAAGTTC AACCCAACGA 

V K V Q P N D 

GTAAAAGATA AGGATCCAAA 

V K D K D P K 

AAGGATGCCC AACAAATTGA 
K D A Q Q I D 

GTAGATAAAA CTGAAATTTT 

V D K T E I L 

CAAATTTCTG ATATCACTCT 
Q I S D I T L 

AATCCTAAAC AAGGTCATTT 
N P K Q G H F 

TCTTGGGAAT TTAAAGACCA 
S W E F K D Q 

CAAGATGGTT CAAAAGTTGA 
Q D G S K V E 

AT TAT T TAT G ACAAAGATAA 
I I Y D K D N 

AATAAAGGTG TTTGGCAGAC 
N K G V W Q T 

TACTATTATC TTTACGAAAT 

V Y Y L Y E I 

GCAAAGTCAT TAGCAGAGTG 
A K S L A E W 

GCAGCTTTTG TAAATCCAAG 
A A F V N P S 

AATTTTAAAG GAAGACAAGA 
N F K G R Q D 

GATCGATCTT TGGATGGAAA 
D R S L D G K 

AAAC TAG AT T ATTTACAGAA 
K L D Y L Q K 

TATTTTTATG TTAATGAAAT 

V F Y V N E M 

AATTACAATT GGGGCTATGA 
N Y N W G Y D 

AAACCAAAAG ATCCATCAGC 
K P K D PSA 



TGGTTTTCTA ATCTTAGATG 
G F L I L D E 

CTATGTTTTT AGAGATTTAG 

Y V F R D L A 

GGTTTATAAT AATCCTTATT 

V Y N N P Y Y 

TTTAACAAGT ATTCAAGCAA 
L T S I Q A S 

AAAAGAATTG AAAGTGACTG 
K E L K V T D 

CGATACTAGT AAATCTCTTT 
D T S K S L L 

CAACATATCT TATAATGGTA 
N I S Y N G N 

ACTTTATGCT TATAGTGGAA 
L Y A Y S G N 

AGCCAGCCTC TGGTCACCGA 
A S L W S P S 

CCAAAACAGG GTTGTAGCGA 
Q N R V V A T 

GATACTTGAT ACTAAATTAG 
I L D T K L G 

AAAAAGAGGT AAGGATAAGG 
K R G K D K V 

GGATAGTAAT ACTGTTAATG 
DSN T V N D 

TCAACTTGGA CCTCAAAATT 
Q L G P Q N L 

TGCTGTTATA TACGAAGCAC 
AVI YEAH 

ATTAAAAAAT CAATTTGGTA 
L K N Q F G T 

ATTAGGAGTT ACACACATTC 
L G V T H I Q 

GGATAAGTCA CGCTCAACAG 
D K S R S T A 

CCCACAGAGC TATTTTGCTC 
P Q S Y F A L 

ACGTATCGCC GAATTAAAAC 
R I A E L K Q 



AAAGTAAGAC AGGAGATGCA 3300 
S K T G D A 

CTAACCATAA CCAAATTTTT 3360 
N H N Q I F 

ACATTGATCA AGTGCAGCTA 3420 
I D Q V Q L 

GTTTTACAAC TCTAGATGGG 34 80 
F T T L D G 

AT AAAAAT C A AAATGCTATA 354 0 
K N Q N A I 

TAATAATCAA AGGCGACTTT 3 600 
I I K G D F 

ACAATGTCAT GACAAGGCAA 3660 
N V M T R Q 

ATTTAGGTGC AGTTCTCAAT 3720 
L G A V L N 

GTGCTGATAG TGTCACTATG 3780 
ADS V T M 

CTACCCCCCT TGTGAAAAAT 38 4 0 
T P L V K N 

GTATTAAAAA CTATACTGGT 3900 
I K N Y T G 

TTAAGATTTT AGATCCTTAT 3 960 
K I L D P Y 

ATGATATTAA AACGGCTAAA 4 020 
D I K T A K 

TAAGTTTTGC TAAAATTGCT 4 080 
S F A K I A 

AT G T AAG AG A CTTCACTTCT 414 0 

V R D F T S 

CCTTTGCAGC CTTTTCAGAG 4200 
F A A F S E 

AGCTTTTACC GGTATTGAGT 4260 
LLP V L S 

CTTACACTTC CTCAGACAAT 4 320 

Y T S SDN 

TTTCTGGGAT GTATTCAGAG 4 38 0 
S G M Y S E 

AATTAATACA TGATATTCAT 4440 
L I H D I H 
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AAACGTGGCA TGGGGGTTAT ACTTGATGTC GTCTATAATC ACACTGCAAA AACTTATCTC 4 500 
KRGM GVI LDV VYNH TAK TYL 

TTTGAGGATA TAGAACCTAA T TAT TAT C AC TTTATGAATG AAGATGGTTC ACCAAGAGAA 4 5 60 
F E D I EPN YYH FMNE DGS PRE 

AGTTTTGGAG GGGGACGTTT AGGAACCACT CATGCAATGA GTCGTCGTGT TTTGGTTGAT 4 620 
SFGG GRL GTT HAMS RRV LVD 

TCCATTAAAT ATCTTACAAG TGAATTTAAA GTTGATGGTT TCCGTTTTGA TATGATGGGA 4 680 
S I K Y LTS EFK VDGF RFD MMG 

GAT CAT GAT G CGGCTGCGAT TGAATTAGCT TATAAAGAAG CTAAAGCTAT TAATCCTAAT 4 74 0 
DHDA A A I ELA YKEA K A I NPN 

AT GAT TAT G A TTGGTGAGGG CTGGAGAACA TTCCAAGGCG ATCAAGGTCA GCCGGTTAAA 4 800 
MIMI GEG WRT FQGD QGQ PVK 

CCAGCTGACC AAGATTGGAT GAAGTCAACC GATACAGTTG GCGTCTTTTC AGATGATATT 4 8 60 
PADQ DWM KST DTVG VFS DDI 

CGTAATAGCT TGAAATCTGG TTTTCCAAAT GAAGGTACTC CAGCTTTCAT CACAGGTGGC 4 920 
RNSL KSG FPN EGTP AFI TGG 

CCACAATCTT TACAAGGTAT TTTTAAAAAT ATCAAAGCAC AACCTGGGAA TTTTGAAGCA 4 98 0 
PQSL Q G I FKN IKAQ PGN FEA 

GATTCGCCAG GAGATGTGGT GCAGTATATT GCTGCACATG ATAACCTTAC CTTGCATGAT 504 0 
D S P G DVV QYI AAHD NLT LHD 

GTGATTGCAA AATCAATT (SEQ ID NO:22) s n co 

V I A K SI. ^ 







FIG. 4a 






NLKAELSVED 


EQYTATVYGK 


SAHGSTPQEG VNGATYLALY 


LSQFDFEGPA 


50 


RAFLDVTANI 


IHEDFSGEKL 


GVAYEDDCMG PLSMNAGVFQ 


FDETNDDNTI 


100 


ALNFRYPQGT 


DAKTIQTKLE 


KLNGVEKVTL SDHEHTPHYV 


PMDDELVSTL 


150 


LAVYEKQTGL 


KGHEQVIGGG 


TFGRLLERGV AYGAMFPGDE 


NTMHQANEYM 


200 


PLENIFRSAA 


IYAEAIYELI 


K (SEQ ID NO:23) 




221 






FIG. 4b 






MTDLEKIIKA 


IKSDSQNQNY 


TENGIDPLFA APKTARINIV 


GQAPGLKTQE 


50 


ARLYWKDKSG 


DRLRQWLGVD 


EETFYHSGKF AVLPLDFYYP 


GKGKSGDLPP 


100 


RKGFAEKWHP 


LILKEMPNVQ 


LTLLVGQYAQ KYYLGSSAHK 


NLTETVKAYK 


150 


DYLPDYLPLV 


HPSPRNQIWL 


KKNPWFEKDL IVDLQKIVAD 


ILKD 


194 


(SEQ ID NO: 


24) 












FIG. 4c 
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MRDNHLHTYF SYDCQTAFED YINGFTGEFI TTEHFDLSNP YTGQDDVPDY 50 

SAYCQKIDYL NQKYGNRFKK GIEIGYFKDR ESDILDYLKN KEFDLKLLSI 100 

HHNGRYDYLQ EEALKVPTKG AFSRLL (SEQ ID NO: 25) 12 & 

FIG. 4d 



MKRKDLFGDK QTQYTIRKLS VGVASVTTGV CIFLHSPQVF AEEVSVSPAT 50 

TAIAESNINQ VDNQQSTNLK DDINSNSETV VTPSDMPDTK QLVSDETDTQ 100 

KGVTEPDKAT SLLEENKGPV SDKNTLDLKV APSTLQNTPD KTSQAIGAPS 150 

PTLKVANQAP RIENGYFRLH LKELPQGHPV ESTGLWIWGD VDQPSSNWPN 2 00 

GAIPMTDAKK DDYGYYVDFK LSEKQRKQIS FLINNKAGTN LSGDHHIPLL 250 

RPEMNQVWID EKYGIHTYQP LKEGYVRINY LSSSSNYDHL SAWLFKDVAT 300 

PSTTWPDGSN FVNQGLYGRY IDVSLKTNAK EIGFLILDES KTGDAVKVQP 350 

NDYVFRDLAN HNQIFVKDKD PKVYNNPYYI DQVQLKDAQQ IDLTSIQASF 400 

TTLDGVDKTE ILKELKVTDK NQNAIQISDI TLDTSKSLLI IKGDFNPKQG 450 

HFNISYNGNN VMTRQSWEFK DQLYAYSGNL GAVLNQDGSK VEASLWSPSA 500 

DSVTMIIYDK DNQNRVVATT PLVKNNKGVW QTILDTKLGI KNYTGYYYLY 550 

EIKRGKDKVK ILDPYAKSLA EWDSNTVNDD IKTAKAAFVN PSQLGPQNLS 600 

FAKIANFKGR QDAVIYEAHV RDFTSDRSLD GKLKNQFGTF AAFSEKLDYL 650 

QKLGVTHIQL LPVLSYFYVN EMDKSRSTAY TSSDNNYNWG YDPQSYFALS 700 

GMYSEKPKDP SARIAELKQL IHDIHKRGMG VILDVVYNHT AKTYLFEDIE 750 

PNYYHFMNED GSPRESFGGG RLGTTHAMSR RVLVDSIKYL TSEFKVDGFR 8 00 

FDMMGDHDAA AIELAYKEAK AINPNMIMIG EGWRTFQGDQ GQPVKPADQD 850 

WMKSTDTVGV FSDDIRNSLK SGFPNEGTPA FITGGPQSLQ GIFKNIKAQP 900 

GNFEADSPGD VVQYIAAHDN LTLHDVIAKS I (SEQ ID NO: 26) 931 

FIG. 4e 
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AATTCAAAGT TTGACAGAAG GTCAACTTCG TTCTGATATC CCTGAGTTCC GTGCTGGTGA fin 
IQS LTEG QLR SDI P E F R AGD 



TACTGTACGT GTTCACGCTA AAGTTGTTGA AGGTACTCGC GAACGTATTC AGATCTTTGA 1 ?n 
TVR VHAK VVE GTR ERIQ I F E 

AGGTGTTGTT ATCTCACGTA AAGGTCAAGG AATCTCAGAA ATGTACACAG TACGTAAAAT 1 RO 
GVV ISRK GQG ISE MYTV RKI 

TTCTGGTGGT ATCGGTGTAG AGCGTACATT CCCAATTCAC ACTCCTCGTG TTGATAAAAT 24 0 
SGG IGVE RTF PIH TPRV DKI 

CGAAGTTGTT CGTTATGGTA AAGTACGTCG TGCTAAACTT TACTACTTAC GCGCATTGCA 300 
EVV RYGK VRR A K L YYLR ALQ 

AGGTAAAGCT GCACGTATTA AAGAAATCCG TCGTTAATTT TGATGATCAG ATTTTAAAAA 360 

TGCTTGGTTG TTTGAGGATA GTAACTATGT TTTAAAACTG GACAACCAAG ACGTAAAAAA 420 

TCTGCCTGTG GGCAGTTTTT TTACTAGGTC CCCTTAGTTC AAT GGATAT A ACAACTCCCT 4 80 

. H I Y C S G 

CCTAAGGAGT AATTGCTGGT TCGATTCCGG CAGGGGACAT ATTCATTGCA TGTAAATAGC 54 0 
GLS YNST RNR CPV YENC TFL 

GGTTTAGAGC TATTTTGCCC CAAATTTCTC TGATTAAGTT TATCGTTCCT ATCTTTTTGT 600 
PKS SNQG LNR QNL KDNR DKQ 

TCTTGTAATT GATGTGCGTA AACTTCTAAA GTGATATTTA AATTCTCGTG ATCTAAAACT 660 
EQL QHAY VEL TIN LNEH DLV 

TGAGAGATGG AAAT TAG AT A GCTTGCAAAT GTATGCCTGA GAGAGTGCAC TCGTACCTCG 720 
Q S I SILY SAF THR LSHV RVE 

CGACCAGTTA TTTTTCGGAT AGTTTTATTG ACTGCATTAT TTGAAAGTTT GTCGAATAAT 780 
RGT IKRI TKN VAN NSLK DFL 

CTGTCGTTTT TATTTTTTGT AAATTCATGC AAAAAAAATA ATGTATCATT GTCAATTGGT 840 
R D N K N K T F E H L F F L .T D N DIP 

ATATTTCTGA TACTACTTTT GTTTTTTGTT GGCAGGTATC TTTGGTTGAA ATGATAATCC 900 
I N R ISSK NKT PLY RQNF HYD 

CAAGTTTTAT TAATTGATAA ATATTTGTTA GTGTAATCAA TATCATTAAC TGTTAAACCT 960 
WTK NISL YKN TYD IDNV TLG 

AAACATTCAG CGAAGCGCAT GCCAGTTTTA GCGATGAGGT ATAACGCTGC ATACGATTGA 1020 
Jj C E A F R M 



< — 



TGTTGTGATT TTTCTTTACA AATTTTTATC AAGCGTAAGT ATTCATTGGT TTCAAGAAAT 1080 

TTTATCTCTA TTTACGCCCC TTATTTTTTG CTTTAACCTT AGTGAATAAA CAAAAATTTT 114 0 

TTTCTATATA TCCCTCGTGA ACAGCCATGG ATACGCAGGC TTTTACATGT ATGTTAAAAC 1200 

GCTTTACTGT ATCTTGCACA TGCGTTTGAC TATAATGATT TATGACTTGT TGATATTTAG 1260 
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TGGAAGTAAT 


ATTGCAAAGT 


AATATATTTC 


CTATTATATG 


TTTATACGAT 


ATTCGATATT 


1320 


CCCACCCGTT 


GTCGCGTTTA 


CGGAAATACG 


C CAT T GAT AT 


ACTCCACATT 


AGCTAAAGAA 


1380 


CAGGGTGTTC 


AAGGCTACCT 


TGATGGAAAA 


GGCTCTCTTA 


GAGATATTTG 


TAAATGGTAT 


1440 


GATATCTCAA 


GTCGCTCTGT 


TCTCCAAAAG 


TGGATAAAAC 


GGTATACTAG 


TGGTGAAGAC 


1500 


TTGAAAGCCA 


CTAGTAGAGG 


ATATAGCCGT 


ATGAAACAAG 


G AAG G C AAG C 


CACATTTGAA 


1560 


GAACGTGTAG 


AGATTGTTAA 


CTACACCATT 


GCCCATGGGA 


AAG AC TAT C A 


AGCAGCTATT 


1620 


GAGAAGTTTG 


GTGTTTCCTA 


CCAACAAATT 


TATTCTTGGG 


TGCGTAAGCT 


TGAGAAGAAT 


1680 


GGCTCACAAG 


GTTTGGTTGA 


TAGACGTGTG 


AAAGGGTTGG 


AGAGTAGGCC 


TGATTTAACC 


1740 


GAGATTGAGC 


AACTTTAACT 


C AAG AT T AAA 


CAATT GGAGG 


AACGTAATCG 


TCTCTTAGAA 


1800 


ATCGAGGTTA 


GTTTACTAAA 


AAAGTTAGAA 


GACATCAAAC 


GAGGAAACAG 


ACGGTAAGAC 


1860 


TAGGTAAGCA 


TTTAGCGGAG 


TTCCAAGTAA 


TCAAGAATTA 


TTACGATGAG 


GAATCTAATG 


1920 


TGCCTATTCA 


GGCCTTATGC 


CAACTCTTGA 


AGGGGTCTCG 


TTCAGGCTAT 


TACAAGTGGC 


1980 


TCAATCGTCA 


AAAAACAGAT 


TTTGAGACAA 


AAAATACAAA 


GCTAATGGCT 


AAAATCAAGG 


2040 


AACTTCGTAG 


ACTCTACAAT 


GGTATCTTAG 


GTTATCGCCG 


TATGACAACA 


TTTATTAATC 


2100 


GTCAACTTGG 


GACAACTTAA 


AACAAGAAAC 


GGATTCGTTG 


ATTGATGAAC 


ATTCTGGGGA 


2160 


TTAGTTCAGT 


CATTCGTCGT 


GTTAGCCATG 


CTTGTACAAA 


AGCTGGTGAC 


AGATTTTACG 


2220 


AAGAAAATAT 


TCTTAATCGT 


GAATTTACAG 


CCACAGCTCA 


TAACCAGAAA 


TGGTGCACAG 


2280 


ATGTCACCTA 


TCTTCAATAC 


GGTCTGGGAG 


CTAAAGCTTA 


TCTCAGTGCG 


ATTAAAGACC 


2340 


TGTATAACGG 


TTCTATTATC 


GCTTATGAGA 


TTAGTCACAA 


CAATGAAATC 


CACTTGTTAT 


2400 


GAAGACCATT 


AAAAAGGGGC 


TAGAGCTCAA 


TCCAGGAGCC 


ACACCTATCA 


TCCATAGCGA 


2460 


TTGAGGTAGT 


CAATATACTT 


CCAAAGAATA 


CCGTTATATC 


ATACAACAAG 


CTGGTCTGAC 


2520 


CTTATCCATG 


TCCCGGATTG 


GCAAATGTAT 


TGATAATGCA 


CCAACTGAAA 


GTTTCTTTGG 


2580 


GTTTTTCAAG 


ACTGAGTCTT 


ACCACCTTAA 


GAAATACAAC 


TCTTATGATG 


AGTTGGTCAA 


2640 


TGATGTGGCA 


CGTTATATCG 


AATTCTACAA 


CACACAACGT 


TATCAATCAA 


AATTAAACAA 


2700 


CCTGACTCCT 


CTAGAATTCA 


GGAATCAGGT 


TGCATAACTT 


ATCTTTTATT 


ATTTGACTGT 


2760 


CTACTTGACA 


GGGAGCCGTT 


CAGATTGCTT 


AACCTTTCTA 


AATTTGCTAA 


AAT AG C T AC A 


2820 


AGAAAACGAG 


CCATTTAATG 


CTTATTTCTT 


ATACTGTCTT 


GCCTCACGCT 


CTCCTCGACC 


2880 


AAAAAT T GAG 


CGTGAGGCTT 


TTTGTTTCAT 


TAAACGATGA 


TATTTCCATA 


TTCATCAGTT 


2940 


TGTTTTCCGA 


GAGC CAT CAA 


AGCTTCGATA 


AGGTCGATAA 


TTCCAGGAAT 


AAAGGTAATA 


3000 


CT AAAAAT AA 


TATATAAAAA 


AACCTGGCCT 


ATTTTTCCTG 


CGTAAAATTT 


ATGCGCTCCA 


3060 


ATGCCGCCCA 


AAAGAACGTT 


AATAAAACAT 


AAACTACTAT 


GTTAGCATAA 


GACTTTATTT 


3120 



27/40 



WO 99/42588 



PCT/CA99/00114 



TTACAACTGA AT TT CAT AT A AATGGATTAG AGTAAGGGAT AAAAGAAATT AGCATAGCTC 3180 

TTTTGAAAAT AAAAAAATTA ATATAATATG GAAAAAATTT TATTTCATAA ACGTTTCATA 324 0 

AAAGGTATGT AATCTAGTAT TTAGGCAACA CTATTTTGTC ACTGGTGTCT AGTAACTTAT 3300 

AGATTGATAA TTTTACTAGT AAACGTAATT CTTCGCTTTA AGAGTTAAAT GTCTATTTAT 3360 

TGTAAGCTAA ATTGGGAGGT GAACTTATGT AAAATTAGAT AGGTACTGTC AAGTACGGGA 34 20 

TGATTATTGA AACAGCCAGT ATGCATCATA AAATCTGTAT TGCTTAATAA CTATTTCCTT 34 80 

AACCAGACAT CAGTTCATTG TTTATCATCG CTACCCTAAG TCTAGTTTTT TCAATAGAGC 3540 

ATTAGGTAGT TTTTGATAAT AAAAC TAT AT AAAC AT GAGA ATTAGATTTC GTATTGCATT 3600 

CTTCATAATG AGTTATTTGA GATTTTCCTT TGAATAAATA GATACGAAAT TCAGTAACTT 3660 

CATATATAAA CGGCTCTATC ATTGAGATAG TTTGTCAAAT GAAGAAATTT TTAATGGAAA 3720 

TAGTTTTAAA AACATTAGTT GTAGGCGATG TAAAAATATT AATCCAGTGG ATGCAATAGT 3780 

TGCGGAGTAA AAATAGAGAG GAGTAATTAG GAAGTGATAA AAAATGCTAT AGCATATATT 3840 

ACCAGAAAAA AAAATAGAAC ACTTATTATA TTTGCTATTT TAACAATTGT TCTTTCTTGC 3900 

TTGTATTCAT GTTTAACAAT AATGAAATCA AGTAATGAAA TAGAAAAGGC TTTATATGAA 3960 

MFCS SNEI EKA LYE 
I > 

AGTTCTAATT CTTCAATATC AATTACAAAA AAAGATGGTA AATATTTTAA TATTAATCAA 4 020 
SSNS SIS ITK KDGK YFN I N Q 

TTTAAGAATA TTGAAAAAAT AAAAGAGGTT GAAGAAAAAA TATTTCAATA TGATGGATTA 4 080 
FKNI EKI KEV EEKI FQY DGL 

GCAAAATTGA AAGATCTTAA AGTAGTTAGT GGTGAGCAAA GTATAAATAG AGAAGATTTA 414 0 
DLK VVS GEQS INR E D L 

TCTGACGAAT TTAAAAATGT TGTTTCACTA GAAGCTACAA GTAATACTAA AAGAAATCTT 4200 
KNV VSL EATS NTK RNL 

TTATTTAGTA GTGGAGTATT TAGTTTTAAA GAAGGAAAAA ATATAGAAGA AAATGATAAG 4260 
LFSS GVF SFK EGKN IEE NDK 

AATTCAATTC TTGTTCATGA AGAATTTGCT AAACAAAACA AACTAAAATT GGGTGATGAA 4 320 
NSIL VHE EFA KQNK LKL GDE 

ATTGATCTTG AATTACTAGA TACGGAAAAA AGTGGAAAAA TAAAAAGTCA T AAAT T T AAA 4 380 
IDLE LLD TEK SGKI KSH KF K 

ATTATAGGAA TCTTTTCTGG TAAAAAACAG GAAACATATA CAGGATTATC ATCTGATTTT 4 44 0 
I I G I FSG KKQ ETYT GLS SDF 

AGCGAAAATA TGGTTTTTGT AGATTATTCA ACTAGCCAAG AAATATTAAA T AAAT CAGAG 4500 
VFV DYS TSQE I L N KSE 

AATAATAGAA TTGCAAATAA AATTTTAATG TATTCTGGTA GTTTAGAATC TACAGAGCTT 4 560 
ANK I L M YSGS L E S TEL 

GCCTTAAACA AATTGAAAGA CTTTAAAATT GATAAGTCAA AGTATTCTAT TAAGAAAGAT 4 620 
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ALNK LKD F K I DKSK Y S I KKD 

AATAAAGCAT TCGAAGAGTC TTTAGAGTCA GTGAGTGGAA TAAAACATAT AATTAAAATA 4 680 
NKAF EES LES VSGI KHI IKI 

ATGACTTATT CGATTATGTT AGGTGGAATA GTTGTTCTTT CATTAATCTT GATTCTATGG 4740 
MTYS I ML GGI V V L S LIL ILW 

TTAAGAGAAA GAATTTATGA AATAGGTATA TTTTTATCTA TTGGAACAAC TAAGATACAA 4 800 
L R E R IYE IGI FLSI GTT K I Q 

ATTATAAGGC AATTTATATT TGAGTTAATA TTCATATCAA TACCAAGTAT AATATCCTCC 4 8 60 
I I R Q FIF ELI FISI PSI I S S 

TTATTTTTAG GGAATCTACT ATTAAAAGTA ATTGTAGAAG GATTTATTAA CTCAGAGAAC 4 920 
LFLG NLL LKV IVEG *F I N SEN 

TCAATGATTT TCGGTGGAAG TTTAATAAAT AAAAGCAGTT TTATGTTAAA CATAACAACA 4 980 
SMIF GGS LIN KSSF MLN ITT 

CTTGCAGAAA GTTATTTAAT ATTAATAAGT ATTATTGTTT TATCAGTTGT AATGGCCTCT 5040 
LAES YLI LIS I I V L SVV MAS 

TCATTAATAT TATTTAAGAA ACCACAAGAA ATATTATCAA AAATAAGTTA GGAGCAAATA 5100 
SLIL FKK PQE ILSK IS. 

AT GG AT AT AT TAGAAATAAA GAATGTAAAT TACAGTTACG CAAATTCTAA AGAAAAAGTT 5160 
M D I L EIK NVN YSYA NSK EKV 
I > 

TTGTCAGGAG TAAATCAAAA ATTTGAACTT GGAAAGTTTT AT G CG AT AG T AGGGAAGT C A 5220 
LSGV NQK FEL GKFY AIV GKS 

GGAACAGGAA AATCCACACT TCTTTCCTTA CTTGCAGGAC TTGATAAAGT TCAAACAGGA 5280 
GTGK STL LSL LAGL DKV QTG 

AAAATCTTGT TTAAGAATGA AGATATAGAA AAGAAAGGAT ATAGTAATCA CAGAAAAAAT 5340 
KILF KNE DIE KKGY SNH RKN 

AATATATCTT TGGTATTTCA AAATTATAAT TTAATAGATT ATTTATCGCC GATTGAAAAT 5400 
NISL VFQ NYN LIDY LSP IEN 

ATTAGACTAG TAAATAAATC AG TAG AT GAG AGTATCTTGT TCGAATTAGG TTTAGATAAA 54 60 
IRLV NKS VDE SILF ELG LDK 

AAACAAATAA AAAGAAATGT TATGAAATTA TCTGGTGGTC AGCAACAAAG GGTAGCTATT 5520 
KQIK RNV MKL SGGQ QQR VAI 

GCTAGGGCAC TGGTATCAGA TGCCCCAATA ATACTAGCTG ATGAGCCTAC CGGTAACCTA 5580 
ARAL VSD API ILAD EPT GNL 

GACAGTGTTA CTGCTGGAGA AATAATT (SEQ ID NO:27> 5607 
DSVTAGEII. 



FIG. 5a 
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IQSLTEGQLR SDIPEFRAGD TVRVHAKVVE GTRERIQIFE GVVISRKGQG 50 

ISEMYTVRKI SGGIGVERTF PIHTPRVDKI EVVRYGKVRR AKLYYLRALQ 100 

GKAARIKEIR R (SEQ ID NO: 28) X11 

FIG . 5b 



MRFAECLGLT 


VNDIDYTNKY 


LSINKTWDYH 


FNQRYLPTKN 


r\b b 1KN 1 PI D 


50 


NDTLFFLHEF 


TKNKNDRLFD 


KLSNNAVNKT 


IRKITGREVR 


VrlbLiKril FAS 


100 


YLISISQVLD 


HENLNITLEV 

X X J— J X i J— J 11 -I- -L -i-J LJ V 


YAHQLQEQKD 


RNDKLNQRNL 


fAXT CC7DT TTTn 


150 


CNEYVPCRNR 


TSNYSLGGST 


YIH (SEQ 


ID NO:29) 




173 






FIG. 5c 






MKSSNEIEKA 


LYESSNSSIS 


ITKKDGKYFN 


INQFKNIEKI 


KEVEEKIFQY 


50 


DGLAKLKDLK 


VVSGFO^ TNR 


EDLSDEFKNV 


VSLEATSNTK 


RNLLFSSGVF 


100 


SFKEGKNIEE 


NDKNSTI.VHF 

XM i-/ XVXN JlitVULl 


EFAKQNKLKL 


GDEIDLELLD 


TEKSGKIKSH 


150 


KFKIIGIFSG 




SDFSENMVFV 


DYSTSQEILN 


KSENNRIANK 


200 


1 JLiM x b Li b Li t> b 


rp TT» T 7\T XT T T/n 

TELALNKLKD 


FKIDKSKYSI 


KKDNKAFEES 


LESVSGIKHI 


250 


IKIMTYSIML 


GGIVVLSLIL 


ILWLRERIYE 


IGIFLSIGTT 


KIQIIRQFIF 


300 


ELIFISIPSI 


ISSLFLGNLL 


LKVIVEGFIN 


SENSMIFGGS 


LINKSSFMLN 


350 


ITTLAESYLI 


LISIIVLSVV 


MASSLILFKK 


PQEILSKIS 




389 


(SEQ ID NO: 


30) 
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MDILEIKNVN 


YSYANSKEKV 


LSGVNQKFEL 


GKFYAIVGKS 


GTGKSTLLSL 


50 


LAGLDKVQTG 


KILFKNEDIE 


KKGYSNHRKN 


NISLVFQNYN 


LIDYLSPIEN 


100 


IRLVNKSVDE 


SILFELGLDK 


KQIKRNVMKL 


SGGQQQRVAI 


ARALVSDAPI 


150 


ILADEPTGNL 


DSVTAGEII 


(SEQ ID NO: 


31) 




169 



FIG. 5e 
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CAT AT G AC AA 


TATTTTTCAA 


AGTCTACATC 


ACTTACTCGC 


CTGTCGTGGA 


AAATCTGGCA 


60 


ATACATTAAT 


CGACCAATTA 


GTTGCTGATG 


GTTTACTTCA 


T GC AGATAAT 


CACTACCATT 


120 


TTTTCAATGG 


GAAGTCTCTG 


GCCACTTTCA 


ATACTAACCA 


ATTGATTCGC 


GAAGTTGTCT 


180 


ATGTTGAAAT 


ATCCTTAGAT 


ACTATGTCTA 


GTGGTGAACA 


TGATTTAGTA 


AAAGTTAACA 


240 


TTATCAGACC 


CACTACCGAG 


CATACTATCC 


CCACGATGAT 


GACAGCTAGC 


CCCTATCATC 


300 


AAGGTATCAA 


TGATCCTGCC 


GCAGACCAAA 


AAACATACCA 


AATGGAGGGT 


GCGCTAGCAG 


360 


TTAAACAGCC 


TAAACACATA 


CAAGTTGACA 


CAAAACCATT 


TAAAGAAGAA 


GTAAAACATC 


420 


CTTCAAAATT 


ACCCATCAGC 


CCTGCAACTG 


AAAGCTTCAC 


AC AC AT TG AC 


AGTTATAGTC 


480 


TCAATGACTA 


TTTTCTTTCT 


CGTGGTTTTG 


CTAATATATA 


CGTTTCAGGT 


GTGGGTACTG 


540 


CTGGCTCTAC 


GGGTTTCATG 


ACCAGTGGGG 


ATTACCAACA 


AATACAAAGC 


TTTAAAGCAG 


600 


TCATTGATTG 


GTTAAATGGT 


AAGGTTACTG 


CATTCACAAG 


TCATAAACGA 


GATAAACAAG 


660 


TCAAGGCTGA 


TTGGTCAAAC 


GGCCTTGTAG 


CAACCACAGG 


TAAATCTTAT 


CTCGGTACCA 


720 


TGTCAACTGG 


TTTAGCAACA 


ACTGGCGTTG 


AGGGGCTGAA 


AG T CAT TAT C 


GCTGAAGCCG 


780 


CAATCTCCAC 


ATGGTATGAT 


TATTATCGAG 


AAAATGGGCT 


TGTGTGTAGT 


CCAGGCGGCT 


840 


ACCCCGGTGA 


AG AT T TAG AC 


GTTTTAACAG 


AATTAACATA 


CTCACGAAAC 


CTCTTAGCTG 


900 


GTGATTACAT 


CAAAAACAAC 


GATTGCTATC 


AAGCATTGTT 


AAATGAACAA 


TCAAAAGCAA 


960 


TTGACCGTCA 


AAGTGGGGAT 


TACAACCAAT 


ACTGGCATGA 


CCGTAATTAC 


CTAACTCACG 


1020 


TCAATAATGT 


CAAAAGTCGA 


GTAGTTTACA 


CTCATGGACT 


ACAGGATTGG 


AATGTTAAGC 


1080 


CAAGACATGT 


CTACAAAGTT 


TTCAATGCAT 


TGCCTCAAAC 


CATCAAAAAA 


CACCTTTTTT 


1140 


TACATCAAGG 


TCAACATGTG 


TATATGCATA 


ATTGGCAGTC 


GATTGATTTT 


CGTGAAAGCA 


1200 


TGAATGCCTT 


ACTAAGCCAA 


GAACTACTTG 


GCATTGACAA 


TCATTTCCAA 


TTAGAAGAGG 


1260 


TCATTTGGCA 


AGATAATACT 


AC TGAGCAAA 


CTTGGCAAGT 


TTTAGATGCT 


TTCGGAGGAA 


1320 


ACCATCAAGA 


GCAAATTGGT 


TTAGGTGATA 


GTAAAAAACT 


TATTGATAAC 


CATTATGACA 


1380 


AAGAAGCCTT 


TGATACTTAT 


TGTAAAGACT 


TCAATGTGTT 


CAAAAATGAT 


CTTTTCAAGG 


1440 


GAAATAATAA 


AACCAATCAA 


ATCACTATTA 


ATCTTCCTCT 


AAAGAAAAAT 


TATCTCCTGA 


1500 


ATGGACAGTG 


CAAACTCCAT 


CTACGTGTTA 


AAACTAGTGA 


CAAAAAGGCC 


ATTTTATCAG 


1560 


CCCAAATCTT 


AGACTATGGT 


CCTAAAAAAC 


GATTCAAAGA 


TACACCAACC 


ATCAAATTCT 


1620 


TAAACAGCCT 


TGATAATGGT 


AAAAATTTTG 


CCAGAGAAGC 


TTTACGTGAA 


CTCCCGTTTA 


1680 


CTAAAGATCA 


TTATCGTGTC 


ATCAGTAAAG 


GTGTCTTGAA 


CCTTCAAAAT 


CGTACAGACT 


1740 


TACTTACAAT 


TGAGGCTATC 


GAGCCAGAAC 


AATGGTTTGA 


TATCGAGTTT 


AGCCTCCAAC 


1800 


CAAGTATATA 


TCAATTGAGT 


AAAGGTGATA ATCTAAGGAT 


TATCCTTTAT 


ACAACTGATT 


1860 


TTGAACATAC 


CATTCGAGAT 


AATGCTAGTT 


ACTCTATAAC 


AGTAGATTTG 


AGTCAATCTT 


1920 


ATTTAACTAT 


CCCAACTAAT 


CAAGGAAATT 


AACTTATGAA 


ACTTCTTACT 


AAAGAACGGT 


1980 


TTGATGATTC 


TCAACACTTT 


TGGTACCAGA 


TCAATTTATT 


AC AAG AG AG T 


AACTTCGGAG 


2040 


CAGTTTTTGA 


CCATGATAAT 


AAAAAC AT T C 


CACAGGTTGT 


TGCAACTATT 


GTTGATGATT 


2100 


TACAAGGTTC 


CGGAAGTTCG 


AATCATTTCT 


GGTATTTTGG 


CAATACTACT 


GATACTTCCA 


2160 


TCCTTATGAT 


TGCTCATTTA 


AATCGAAAAT 


TCTATATTCA 


GGTTAATTTA 


AAGGACTTTG 


2220 


ACTTTGCACT 


CAATTTAATA 


GCTATAAATA 


AT T GGAAG AG 


TCTCCTCCAA 


ACTCAACTTG 


2280 


AAGCTCTAAA 


CGATACCCTA 


GCAATATTTC 


AATAAATAAG 


GTAGAATGGA 


GTGACAAAGC 


2340 


AACGCGAGGG 


AGACTGATTA 


ATGTCATCTT 


ATTGGAATAA 


CTATCCTGAA 


CTTAAAAAAA 


2400 
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ATATTGATGA 


AACCAATCAA 


CTAATTCAAG 


AAAGAATACA 


GGTCAGAAAT 


AAAG AT AT T G 

***** * x n x x VJ 


9 4 fin 


AAGCGGCGCT 


AAGCCAACTC 


ACAGCTGCGG 


GAGGAAAACA 


GCTCAGACCA 


GCATTCTTTT 


9 s 9 o 

^ o z u 


ACCTTTTTTC 


TCAACTTGGT 


AATAAGGAGA 


ATCAAGATAC 


TCAGCAACTA 


AAGAAAATCG 


Ocon 

o o u 


CTGCTTCTTT 


AGAAATCCTT 


CACGTTGCTA 


CATTAATCCA 


TGATGATGTC 


ATT GAT GAG T 




CACCACTAAG 


ACGTGGAAAT 


ATGACCATTC 


AAAGCAAGTT 


TGGCAAAGAC 


ATCGCAGTTT 


9 "7 n n 


ATACTGGGGA 


TTTACTTTTC 


ACAGTCTTTT 


TCGATCTTAT 


TTTAGAATCT 


ATGACTGATA 


c. / DU 


CACCATTTAT 


GAGGATTAAT 


GCAAAATCTA 


TGCGTAAAAT 


TCTCATGGGA 


G AAT T G GAP H 


9 fl 9 n 

Z O Z U 


AGATGCACCT 


TCGTTACAAT 


CAACAACAAG 


G TAT C CATC A 


CTATTTACGT 


GCGATTTCAG 




GTAAGACAGC 


CGAACTCTTT 


AAATTAGCTA 


GCAAAGAAGG 


AGCTTACTTT 


GGTGGTGCAG 


2 94 0 


AGAAGGAGGT 


TGTTCGTCTA 


GCAGGCCATA 


TCGGCTTTAA 


CATTGGTATG 


ACATTCCAAA 


3000 


TTTTGGATGA 


TATCCTGGAT 


TATACTGCAG 


ATAAAAAAAC 


ATTTAATAAG 


CCTGTCTTAG 




AGGATTTAAC 


ACAAGGCGTT 


TACAGCCTTC 


CTCTACTTCT 


TGCCATTGAA 


GAAAATCCTG 




ATATTTTCAA 


ACCTATTTTA 


GATAAAAAAA 


CAGATATGGC 


TACTGAAGAC 


ATGGAAAAAA 


JlOU 


TTGCTTATCT 


CGTCGTTTCC 


CATAGAGGTG 


TTGACAAAGC 


TCGCCATCTA 


GCTCGTAAAT 




TTACTGAGAA 


AGCTATTAGT 


GACATAAATA 


AGCTACCCCA 


GAACTCTGCA 


AAAAAAPAHT 


-3 o n n 


TGCTACAATT 


AACTAATTAC 


CTTTTAAAAC 


GCAAAATTTA 


AATAATAAAA 


AAACATTCTA 


"3 ^ £0 


CAATGCTAGA 


AAAGCAGTTA 


GGGAATGTTT 


TTTTATTATC 


AT T TAT T TAT 


CGCACCTATP 


"34 90 

J4ZU 


AATCATCATA 


GAT C AC CATC 


ATCAGCGGCT 


TTCAGCTGAC 


GGTAACGTTG 


ACTACTTTGA 


"34 RO 


GACAATTCTT 


GAGGAGAACC 


TTCCAACTCT 


AATTGCCCAT 


TTTCTATAAA 


TAAGATACGA 


"3 R4 O 


TCAGCATGTT 


CAATACCTTT 


TAAGTGATGT 


GTAATCCAAA 


CTAAGGTCTT 


ACCTTCCAAT 

*x^>^^ x x v^v^r^r^ x 




TCTTTCATAA 


ATACCCTTAG 


TAAGGCTTGT 


TCAGTAATAG 


GATCAAGTCC 


AACAGTTGGC 


o o o u 


TCATCTAAGA 


TAACAATTGG 


GACATCTTTT 


AGTAAGATTC 


TAGCCAAAGC 


AATTCTATGG 


"3790 


CTTTCGCCAC 


CTGAAAACCT 


AAGTCCAGCT 


TCATCAACCA 


TTGTATAGAG 


ACCATCTGAT 


^7ft 0 


AAATCAGTGA 


CCATCTCTTT 


CAATCCAACT 


CGTTCAAGAA 


CTTTCCATAC 

x x x > w \^ x riu 


ATPTTPTTPA 

" X X X v-> X X Ort 


JO^U 


CTAGCATCTT 


GGTTTCCAAT 


GCGAATGTTA 


TTTAGCAGGG 


TTGTATTAAA 






GCTTGTTGTA 


TCACTCCAAT 


ATAGTTAGAA 


ATGCAATCAC 


CAACTATTGA 


AACATCAGCA 


3960 


CCGCCTAGGG 


TAATCTTCCC 


TTGACTTGCT 


TTCAAGTCGC 


CACGAAGTAG 


ACTAGCTAAG 


4020 


GTACTCTTGC 


CAGAACCACT 


CCGCCCTAAA 


ATAGCAATTT 


TTTCTCCTTC 


TTTAATATCC 


4080 


AAATCTAAAT 


GATGCAAAAC 


CCATTTCTCT 


TGTGGCTTAT 


ACTGGAAACT 


TAAATTCTTG 


4140 


AC GGAAAAAT 


CATATGGCTT 


ATTAGGCAAT 


T (SEQ ID 


NO: 32) 




4171 
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YDNIFQSLHH 


LLACRGKSGN 


TLIDOLVADG 


LLHADNHYHF 


r in oJaO J-irl ± r IN 


o u 


TNQLIREVVY 


VEISLDTMSS 


GEHDLVKVNI 


IRPTTEHTTP 


TMMTZV ^PVHH 
x viri x i-io ir x riy 


1UU 


GINDPAADQK 


TYQMEGALAV 


KQPKHIQVDT 


KPFKEEVKHP 


o i\xj ir jl o it /i i Hj 




SFTHIDSYSL 


NDYFLSRGFA 


NIYVSGVGTA 


GSTGFMTSGD 




o n n 


IDWLNGKVTA 


FTSHKRDKQV 


KADWSNGLVA 


TTGKSYLGTM 




OCA 


GLKVIIAEAA 


ISTWYDYYRE 


NGLVCSPGGY 


PGEDLDVLTE 


T.TY9PMT T ar 


"3 n n 


DYIKNNDCYQ 


ALLNEQSKAI 


DRQSGDYNQY 


WHDRNYLTHV 

"11 1— ' l\ll| X XJ XXIV 


K[K?\7K 9 p \7\TVrp 
IN IN V A.Ui\ V V 1 1 


*3 r\ 


HGLQDWNVKP 


RHVYKVFNAL 


PQTIKKHLFL 


HQGOHVYMHN 


viyuxui. ixxj O L v l 


fi U U 


NALLSQELLG 


IDNHFQLEEV 


IWODNTTEOT 


vjj \-Jr\ x 


n.^J_j^xoxjLj Do 




KKLIDNHYDK 


EAFDTYCKDF 


NVFKNDLFKG 


NNKTNQITIN 


LPLKKNYLLN 


500 


GQCKLHLRVK 


TSDKKAILSA 


QILDYGPKKR 


FKDTPTIKFL 


NSLDNGKNFA 


550 


REALRELPFT 


KDHYRVISKG 


VLNLQNRTDL 


LTIEAIEPEQ 


WFDIEFSLQP 


600 


SI YQLSKGDN 


LRIILYTTDF 


EHTIRDNASY 


SITVDLSQSY 


LTIPTNQGN 


649 


(SEQ ID NO: 


33) 











FIG. 6b 



MKLLTKERFD 


DSQHFWYQIN 


LLQESNFGAV FDHDNKNIPQ 


VVATIVDDLQ 


50 


GSGSSNHFWY 


FGNTTDTSIL 


MIAHLNRKFY IQVNLKDFDF 


ALNLIAINNW 


100 


KSLLQTQLEA 


LNDTLAIFQ 


(SEQ ID NO:34) 




119 






FIG. 6c 






MSSYWNNYPE 


LKKNIDETNQ 


LIQERIQVRN KDIEAALSQL 


TAAGGKQLRP 


50 


AFFYLFSQLG 


NKENQDTQQL 


KKIAASLEIL HVATLIHDDV 


IDDSPLRRGN 


100 


MTIQSKFGKD 


IAVYTGDLLF 


TVFFDLILES MTDTPFMRIN 


AKSMRKILMG 


150 


ELDQMHLRYN 


QQQGIHHYLR 


AISGKTAELF KLASKEGAYF 


GGAEKEVVRL 


200 


AGHIGFNIGM 


TFQILDDILD 


YTADKKT FNK PVLEDLTQGV 


YSLPLLLAIE 


250 


ENPDIFKPIL 


DKKTDiMATED 


MEKIAYLVVS HRGVDKARHL 


ARKFTEKAIS 


300 


DINKLPQNSA 


KKQLLQLTNY 


LLKRKI (SEQ ID NO: 35) 




326 



FIG. 6d 
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LPNKPYDFSV KNLSFQYKPQ EKWVLHHLDL DIKEGEKIAI LGRSGSGKST 50 

LASLLRGDLK ASQGKITLGG ADVSIVGDCI SNYIGVIQQA PYLFNTTLLN 100 

NIRIGNQDAS EEDVWKVLER VGLKEMVTDL SDGLYTMVDE AGLRFSGGER 150 

HRIALARILL KDVPIVILDE PTVGLDPITE QALLRVFMKE LEGKTLVWIT 200 

HHLKGIEHAD RILFIENGQL ELEGSPQELS QSSQRYRQLK AADDGDL 247 
(SEQ ID NO:36) 

FIG. 6e 
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AATTCTATTT 


GGAGGTTTTT 


C T T G AAT AAA 


. TGGTTAGTTA 


. AGGCAAGTTC 


x x r\yj 1 lul 1 


oU 


TTAGGTGGTA 


TGGTTTTATC 


TGCGGGTTCC 


CGAGTTTTAG 


CGGATACTTA 




1 o r\ 
1Z0 


ATTGATAATG 


GTAGAATTAC 


AACAGGTTTC 


AATGGTTATC 


CTGGACATTG 


x uuuo X oor\x 


180 


TATGCTGTTC 


CGACTGGAAC 


GATTATTAGG 


GCAGTGGCAG 


ATGGTACTGT 


r?AAZ\TTTr , r , 7\ 

urtrtAl 1 x bLA 


240 


GGAGCTGGAG 


CCAACTTTTC 


TTGGATGACA 


GACTTAGCAG 


GAAATTGTGT 


PATPnTTPTiTx 


300 


CATGCGGATG 


GAATGCATAG 


TGGTTACGCT 


CATATGTCAC 


GTGTGGTGGC 




360 


GAAAAAGTCA 


AACAAGGAGA 


TATCATCGGT 


TACGTAGGAG 


CAACTGGTAT 




4^0 


CCTCACCTTC 


ATTTTGAATT 


TTTACCAGCT 


AACCCTAATT 


T T C AAAAT GG 




4 o U 


CGTATCAATC 


CAACGTCACT 


AATTGCTAAC 


GTTGCGACCT 


TTAGTGGAAA 


AAPPPAAPpzy 


c a n 
3 4 U 


TCAGCTCCAA 


GCATTAAGCC 


AT T AC AAT C A 


GCTCCTGTAC 


AGAATCAATC 


TAPT A A ATT 21 


oUU 


AAAGTGTATC 


GAGTAGATGA 


ATTACAAAAG 


GTTAATGGTG 


TTTGGTTAGT 


PAAAAZxTZiZar' 


ofoU 


ACCCTAACGC 


CGACTGGGTT 


TGATTGGAAC 


GATAATGGTA 


T AC C AG C A T P 


A P A A 21 T TP 21 T 


"7 o n 


GAGGTTGATG 


CTAATGGTAA 


TTTGACAGCT 


GACCAGGTTC 


T T C AAAA Ann 


1 1 111 


^7 o r\ 
/ oU 


ATCTTTAATC 


CTAAAACTCT 


TAAGACTGTA 


GAAAAACCCA 


TCCAAGGAAP 




o4 0 


ACTTGGGCTA 


AGACACGCTT 


TGCTAATGGT 


AGTTCAGTTT 


GGCTTCGPPT 


TPAPZizir , 2ir , T 


n A rt 

yuu 


CAAGAACTGC 


TTTACAAATA 


GTTTGAGGTA 


TTGATTCATT 


GTTTTAAATG 

x x x x xazaax x yj 


APAPTTTTPT 


you 


TACTAACTAA 


GTACAATTTC 


TTTAAACCGT 


CTGAAAATAA 


TTTTATAGTP 

X X X X X X \— * 


PAPTZ\z\z\r , Tr' 




T GAT AT TATA 


GTCTCGGACT 


AATAAAAAGG 


AAATAGGAAT 


TGAAGCAATP 


AAAATPHZ1T21 


lUoU 


AAAAGGTACT 


ATTGACATCG 


ACAATGGCAG 


CTTCGCTATT 


ATCAGTCGCA 


APTPTTP21 21 C 


i i j n 
X 1 4 U 


CACAAGAAAC 


AGATACGACG 


TGGACAGCAC 


GTACTGTTTC 


AGAGGTAAAP 


uUlb/iJ, x x 


t o n n 
±ZUU 


TAAAGCAAGA 


CAATAAATCA 


TCATATACTG 


TGAAATATGG 


TGATAPAPTA 


APPPTT21TTT 


1 O^A 

IzoU 


CAGAAGCAAT 


GTCAATTGAT 


ATGAATGTCT 


TAGCAAAAAT 


TAATAAPATT 


PP Zl P 21 T 21 TP 71 




ATCTTATTTA 


TCCTGAGACA 


ACACTGACAG 


TAACTTACGA 


TPAPAAPAPT 


PH T 21 fTrrr 7\ 


1 Ton 


CTTCAATGAA 


AATAGAAACA 


CCAGCAACAA 


ATGCTGCTGG 


TCAAAPAAPA 


x vj x 


T A A n 


ATTTGAAAAC 


CAATCAAGTT 


TCTGTTGCAG 


ACCAAAAAGT 


TTCTCTCAAT 

x x w x v_, j. L^rlri x 


APAZlTTTPrT* 


IjUU 


AAGGTATGAC 


AC CAGAAGCA 


GCAACAACGA 


TTGTTTCGCC 


AATGAAPAPA 


ini X ^ X ltlo 


IjDU 


CGCCAGCTTT 


GAAATCAAAA 


GAAGTATTAG 


CACAAGAGCA 


AGCTGTTAGT 


P 21 21 (ZH 21 PP 21 P 




CTAATGAACA 


GGTATCAACA 


GCTCCTGTGA 


AGTCGATTAC 


TTCAGAAGTT 


ppnpr , 2ir , r , T2i 


1 DO U 


AAGAGGAAGT 


TAAACCAACT 


CAGACGTCAG 


TCAGTCAGTC 


AACAACAGTA 


tpzappupptt 


i "7 a n 
1 / 4 U 


CTGTTGCCGC 


TGAAACACCA 


GCTCCAGTAG 


CTAAAGTAGC 


ACCGGTAAGA 


A PTPT 21 r*P 21 r* 


i q n n 


CCCCTAGAGT 


GGCAAGTGTT 


AAAGTAGTCA 


CTCCTAAAGT 


AGAAACTGGT 




t q ac\ 


AGCATGTATC 


AGCTCCAGCA 


GTTCCTGTGA 


CTACGACTTC 


AACAGCTACA 


Pz\r , 2ir , T2i2ir*T 


i Qon 

i y^u 


TACAAGCGAC 


TGAAGTTAAG 


AGCGTTCCGG 


TAGCACAAAA AGCTCCAACA 




i Qon 

i y o u 


TAGCACAACC 


AGCTTCAACA 


ACAAAT GC AG 


TAGCTGCACA 


TCCTGAAAAT 


p p zi p p p p t h 


on/in 
£ U4 U 


AACCTCATGT 


TGCAGCTTAT 


AAAGAAAAAG 


TAGCGTCAAC 


TTATGGAGTT 


AAT G AAT T C A 


2100 


GTACATACCG 


TGCAGGTGAT 


CCAGGTGATC 


ATGGTAAAGG 


TTTAGCAGTC 


GACTTTATTG 


2160 


TAGGTAAAAA 


CCAAGCACTT 


GGTAATGAAG 


TTGCACAGTA 


CTCTACACAA 


AATATGGCAG 


2220 


CAAATAACAT 


TTCATATGTT 


ATCTGGCAAC 


AAAAGTTTTA 


CTCAAATACA 


AAT AG TAT TT 


2280 


ATGGACCTGC 


TAATACTTGG 


AATGCAATGC 


CAGATCGTGG 


TGGCGTTACT 


GCCAACCATT 


2340 


ATGACCATGT 


TCACGTATCA 


TTTAACAAAT 


AATATAAAAA AGGAAGCTAT 


TTGGCTTCTT 


2400 
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TTTTATATGC 

X X X X *i X X* X v-* 


CTTGAATAGA 


CTTTCAAGGT 


xv^x Inl 1 


1111 lnl li-V/i. 


Ai ICACCjAvjA 


24 60 


TTAAGCTATA 


AGTCTGAAAC 


TACTTTCACG 


TTAACCGTGA 


PTAAZiTr'ZiZXZi 


nLbl 1AAAAL 


0 c 0 rv 


TAAAATCTAA 


GTCTGTAAAG 


AT TAT T G AAA 

* *. X X XX X X UiATUi 


ACGCTTTAAA 


AAPAP.ElTATLx 


AlAAbbl ilG 


2580 


TAGATATCTA 


AAATTAAAAA 


AGATAAGGAA 


PTftJACZlZiTZiT 
vj i o/-Ha/\ri 1 i-i. 1 


b^LAL/il CIA 


AG T AAAG AAG 


2640 


CTTTTAAAAA 


GCAAATAAAA 

X *XiiJtX*i 


AATGGCATTA 

**** J- wVjwri j. X f\ 


1 1 vj x o 1 y^>i\ 1 \jj 


1 CH/ioC 1 1 iC 


CCTGGGGAGC 


2700 


CTCTTTATAC 


TGAAAGTGGA 


GGTGTTATGP 


tlLII 11 /iCC 


ill CCCAbC 1 


CAAGAAGCAG 


2760 


GAGCGGTTGG 


T AT AAG AG C C 


AATAGTGTPr 


lj^oi-lC/A.1 1 Pu\ 


CCAAA 1 1 LAA 


GAAGTTACTA 


2820 


ATTTACCTAT 


CATCGGCATT 


ATTAAACGTG 


rlM.X M.1 CC ICC 


ALAAbAAL LA 


rp rp rp 7\ rp /—» 71 _ 

I 1 1ATCACTG 


2880 


CTACGATGAC 


AGAGGTGGAT 


CAATTAGPTA 


will i-iori 1 i\ 1 


1 CCfiC li-Lf-ilii 


Pppfnm7\p7i rp rn 

bbbl IAGATT 


2 94 0 


GT AC ACT TAG 


AGAGCGTCAT 


GATGGTTTGA 

vjn x v> w xxx Oil 


olbl nbL 1 


Oil 1A1 1 CAA 


A 7\ /"* 7\ rp 7\ TV TV TV 

AAbA 1 AAAAG 


3000 


GGAAATATCC 


TGAACAGTTG 


PTAATGGPTn 
J- x x o 


nlnl AAbi 1 AC 


1111 (jAAbxAA 


rr> T\ tv "A 7\ tv m /— * 

GGTAAAAATG 


3060 


CTTTTGAAGC 


AGGAGTTGAT 


TTTGTGGGTA 

X X 1VJ1 UVJO 1/1 


C/1/\C ILiAiL 


1 C(jA 1 AC ALA 


GAT T AC AGC C 


3120 


GCCAAGAAGA 


AGGACCGGAT 


atap.a arTrr 




llbl L.AAGCC 


GGTATAGATG 


3180 


TGATTGCGGA 


AGGTAAAATT 


P7\ipj\ P rp /-^m T\ 

LA1AL 1 CC T A 


7\ /"^ TV t\ y**^ rrr tv tv 

AGCAAGCTAA 


TGAAATTAAT 


CATATAGGTG 


3240 


TTGCAGGAAT 


TGTAGTTGGT 


GGTGCTATCA 


CTAGACCAAA 


AGAAATAGCG 


GAGCGTTTCA 


3300 


TCTCAGGACT 


TAGTTAAAAG 


TGTTACTCAA 


AAATCAAAAT 


CAAAATAAAA 


AAGGGGAATA 


3360 


GTTATGAGTA 


TCAAAAAAAG 


TGTGATTGGT 


TTTTGCCTCG 


GAGCTGCAGC 


ATTATCAATG 


3420 


TTTGCTTGTG 


TAGACAGTAG 


TCAATCTGTT 


ATGGCTGCCG 


AGAAGGATAA 


AGTCGAAATT 


3480 


(SEQ ID NO:37) 
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NSIWRFFLNK 


WLVKASSLVV 


LGGMVLSAGS 


RVLADTYVRP 


IDNGRITTGF 


50 


NGYPGHCGVD 


YAVPTGTIIR 


AVADGTVKFA 


GAGANFSWMT 


DLAGNCVMIQ 


100 


HADGMHSGYA 


HMSRVVARTG 


EKVKQGDIIG 


YVGATGMATG 


PHLHFEFLPA 


150 


NPNFQNGFHG 


RINPTSLIAN 


VATFSGKTQA 


SAPSIKPLQS 


APVQNQSSKL 


200 


KVYRVDELQK 


VNGVWLVKNN 


TLTPTGFDWN 


DNGIPASEID 


EVDANGNLTA 


250 


DQVLQKGGYF 


IFNPKTLKTV 


EKPIQGTAGL 


TWAKTRFANG 


SSVWLRVDNS 


300 


QELLYK (SEQ ID NO: 38) 






306 
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MKMNFKK'VT.T.T 

L i I\L ilN i\X\ V Xj Xj X 


9TMAA9T.T 




rp r,7 rp 7\ t—j rn T7PPT7 

I WTARTVSEV 


KADLVKQDNK 


50 


oo X 1 VJM'jJJl 


T <Z\7T CFIiMC T 

Lov x oxii/iiyio x 


UMNVLAKINN 


IADINLI YPE 


TTLTVTYDQK 


100 


jnini o L ll\ X Hi 


T P ATM A Zir^PiT 1 


1A1 VUijKTNQ 


VSVADQKVSL 


NTISEGMTPE 


150 


AATTIVSPMK 


TYSSAPALKS 


KEVLAQEQAV 


SQAAANEQVS 


TAPVKSITSE 


200 


VPAAKEEVKP 


TQTSVSQSTT 


VSPASVAAET 


PAPVAKVAPV 


RTVAAPRVAS 


250 


VKVVTPKVET 


GASPEHVSAP 


AVPVTTTSTA 


TDSKLQATEV 


fvovF V AyKAP 


300 


TATPVAQPAS 


TTNAVAAHPE 


NAGLQPHVAA 


YKEKVASTYG 


VNEFSTYRAG 


350 


DPGDHGKGLA 


VDFIVGKNQA 


LGNEVAQYST 


QNMAANNISY 


VIWQQKFYSN 


400 


TNSI YGPANT 


WNAMPDRGGV 


TANHYDHVHV 


SFNK (SEQ 


ID NO:39) 


434 






FIG. 7c 






MPHLSKEAFK 


KQIKNGIIVS 


CQALPGEPLY 


TESGGVMPLL 


ALAAQEAGAV 


50 


GIRANSVRDI 


KEIQEVTNLP 


IIGIIKREYP 


PQEPFITATM 


TEVDQLASLD 


100 


IAVIALDCTL 


RERHDGLSVA 


EFIQKIKGKY 


PEQLLMADIS 


TFEEGKNAFE 


150 


AGVDFVGTTL 


SGYTDYXRQE 


EGPDIELLNK 


LCQAGIDVIA 


EGKIHTPKQA 


200 


NEINHTGVAG 

i. » l_l -1- 1111 -2- V_J V ii V_J 




KEIAERFISG 


LS ( SEQ ID NO:40) 


232 
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MSIKKSVIGF 


CLGAAALSMF 


ACVDSSQSVM 


AAEKDKVEI 




39 


(SEQ ID NO: 


41) 
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ATGAAAATGA ATAAAAAGGT ACTATTGACA TCGACAATGG CAGCTTCGCT 50 

ATTATCAGTC GCAAGTGTTC AAGCACAAGA AACAGATACG ACGTGGACAG 100 

CACGTACTGT TTCAGAGGTA AAGGCTGATT TGGTAAAGCA AGACAATAAA 150 

T CAT CAT AT A CTGTGAAATA TGGTGATACA CTAAGCGTTA TTTCAGAAGC 200 

AATGTCAATT GATATGAATG TCTTAGCAAA AATTAATAAC ATTGCAGATA 250 

TCAATCTTAT TTATCCTGAG ACAACACTGA CAGTAACTTA CGATCAGAAG 3 00 

AGTCATACTG CCACTTCAAT GAAAATAGAA ACACCAGCAA CAAATGCTGC 350 

TGGTCAAACA ACAGCTACTG TGGATTTGAA AACCAATCAA GTTTCTGTTG 4 00 

CAGACCAAAA AGTTTCTCTC AATACAATTT CGGAAGGTAT GACACCAGAA 450 

GCAGCAACAA CGATTGTTTC GCCAATGAAG ACATATTCTT CTGCGCCAGC 500 

TTTGAAATCA AAAGAAGTAT TAGCACAAGA GCAAGCTGTT AGTCAAGCAG 550 

CAGCTAATGA ACAGGTATCA ACAGCTCCTG TGAAGTCGAT TACTTCAGAA 600 

GTTCCAGCAG CTAAAGAGGA AGTTAAACCA ACTCAGACGT CAGTCAGTCA 650 

GTCAACAACA GTATCACCAG CTTCTGTTGC CGCTGAAACA CCAGCTCCAG 700 

TAGCTAAAGT AGCACCGGTA AGAACTGTAG CAGCCCCTAG AGTGGCAAGT 750 

GTTAAAGTAG TCACTCCTAA AGTAGAAACT GGTGCATCAC CAGAGCATGT 800 

ATCAGCTCCA GCAGTTCCTG TGACTACGAC TTCAACAGCT ACAGACAGTA 850 

AGTTACAAGC GACTGAAGTT AAGAGCGTTC CGGTAGCACA AAAAGCTCCA 900 

ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 950 

ACATCCTGAA AATGCAGGGC TCCAACCTCA TGTTGCAGCT TATAAAGAAA 1000 

AAGTAGCGTC AACTTATGGA GTTAATGAAT TCAGTACATA CCGTGCAGGT 1050 

GATCCAGGTG ATCATGGTAA AGGTTTAGCA GTCGACTTTA TTGTAGGTAA 1100 

AAACCAAGCA CTTGGTAATG AAGTTGCACA GTACTCTACA CAAAATATGG 1150 

CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTACTCAAAT 1200 

ACAAATAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 1250 

TGGTGGCGTT ACTGCCAACC ATTATGACCA TGTTCACGTA TCATTTAACA 1300 

AATAA 13 05 

(SEQ ID NO:42) 

FIG. 8 
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CAAGAAACAG 

TGATTTGGTA 

ATACACTAAG 

GCAAAAATTA 

ACTGACAGTA 

TAGAAACACC 

TTGAAAACCA 

AATTTCGGAA 

TGAAGACATA 

C AAG AG CAAG 

TCCTGTGAAG 

AACCAACTCA 

GTTGCCGCTG 

TGTAGCAGCC 

AAACTGGTGC 

ACGACTTCAA 

CGTTCCGGTA 

CTTCAACAAC 

CCTCATGTTG 

TGAATTCAGT 

TAGCAGTCGA 

GCACAGTACT 

CTGGCAACAA 

ATACTTGGAA 

GACCATGTTC 



ATACGACGTG 

AAGCAAGACA 

CGTTATTTCA 

ATAACATTGC 

ACTTACGATC 

AGCAACAAAT 

ATCAAGTTTC 

GGTATGACAC 

TTCTTCTGCG 

CTGTTAGTCA 

TCGATTACTT 

GACGTCAGTC 

AAACACCAGC 

CCTAGAGTGG 

ATCACCAGAG 

CAGCTACAGA 

GCACAAAAAG 

AAATGCAGTA 

CAGCTTATAA 

ACATACCGTG 

CTTTATTGTA 

CTACACAAAA 

AAGTTTTACT 

TGCAATGCCA 

ACGTATCATT 



GACAGCACGT 

ATAAATCATC 

GAAGCAATGT 

AGATATCAAT 

AGAAGAGTCA 

GCTGCTGGTC 

TGTTGCAGAC 

CAGAAGCAGC 

CCAGCTTTGA 

AGCAGCAGCT 

CAGAAGTTCC 

AGTCAGTCAA 

TCCAGTAGCT 

CAAGTGTTAA 

CATGTATCAG 

CAGTAAGTTA 

CTCCAACAGC 

GCTGCACATC 

AGAAAAAGTA 

CAGGTGATCC 

GGTAAAAACC 

TATGGCAGCA 

CAAATACAAA 

GATCGTGGTG 

TAACAAATAA 



ACTGTTTCAG 
ATATACTGTG 
CAATTGATAT 
CTTATTTATC 
TACTGCCACT 
AAACAACAGC 
CAAAAAGTTT 
AACAACGATT 
AATCAAAAGA 
AATGAACAGG 
AGCAGCTAAA 
CAACAGTATC 
AAAGTAGCAC 
AGTAGTCACT 
CTCCAGCAGT 
CAAGCGACTG 
AACACCGGTA 
CTGAAAATGC 
GCGTCAACTT 
AGGTGATCAT 
AAGCACTTGG 
AATAACATTT 
TAGTATTTAT 
GCGTTACTGC 
(SEQ ID 



AGGTAAAGGC 50 
AAATATGGTG 100 
GAATGTCTTA 150 
CTGAGACAAC 200 
TCAATGAAAA 250 
TACTGTGGAT 3 00 
CTCTCAATAC 3 50 
GTTTCGCCAA 4 00 
AGTATTAGCA 450 
TATCAACAGC 5 00 
GAGGAAGTTA 55 0 
ACCAGCTTCT 600 
CGGTAAGAAC 65 0 
CCTAAAGTAG 700 
TCCTGTGACT 750 
AAGTTAAGAG 8 00 
GCACAACCAG 850 
AGGGCTCCAA 900 
ATGGAGTTAA 950 

GGTAAAGGTT 10 00 

TAATGAAGTT 1050 

CATATGTTAT 1100 

GGACCTGCTA 1150 

CAACCATTAT 1200 

NO:43) 1230 



FIG, 



QETDTTWTAR 
AKINNIADIN 
LKTNQVSVAD 
QEQAVSQAAA 
VAAETPAPVA 
TTSTATDSKL 
PHVAAYKEKV 
AQYSTQNMAA 
DHVHVSFNK 



TVSEVKADLV 
LIYPETTLTV 
QKVSLNTISE 
NEQVSTAPVK 
KVAPVRTVAA 
QATEVKSVPV 
ASTYGVNEFS 
NNISYVIWQQ 
(SEQ ID NO: 



KQDNKSSYTV 
TYDQKSHTAT 
GMTPEAATTI 
SITSEVPAAK 
PRVASVKVVT 
AQKAPTATPV 
TYRAGDPGDH 
KFYSNTNSIY 
44) 



KYGDTLSVIS 
SMKIETPATN 
VSPMKTYSSA 
EEVKPTQTSV 
PKVETGASPE 
AQPASTTNAV 
GKGLAVDFIV 
GPANTWNAMP 



EAMSIDMNVL 50 

AAGQTTATVD 100 

PALKSKEVLA 150 

SQSTTVSPAS 200 

HVSAPAVPVT 250 

AAHPENAGLQ 300 

GKNQALGNEV 350 

DRGGVTANHY 4 00 
409 



FIG. 9a 



39/40 



WO 99/42588 



PCT/CA99/00114 



Fig. 10 



10 s 



0) 

% 105 

m 

9 

| 10< 

CO 

o 

•5 10 3 - 
o 

2 

CL 

o 10 2 - 

<D 

tr 



10 1 
10 s 



Experience MB-14 
NCS 954 ( type III) 



m 

9 

* 10H 



■5 10* 



o 
a 

8 

O. 

I 



101- 



post-1 st post-2nd post-3rd 



10 s 



post-1 st post-2nd 


post-3rd 


Experience MB-20 




NCS 535 (type V) . 

• ••• 




•••• 


• 


• 


• 


•••• 
• 





10 5 - 
10* 

103 

io 2 -| 
101 

1Q3 



Experience MB-16 
ATCC12401 (type lb) 
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SEQUENCE LISTING 

<110> BioChem Vaccins 
RIOUX, Clement 
DENIS, Martin 
BRODEUR , Bernard R. 
HAMEL, Josee 
CHARLEBOIS , Isabelle 
BOYER, Martine 



<12 0> NOVEL GROUP B STREPTOCOCCUS ANTIGENS 



<130> 12806-9PCT 

<150> 60/075,425 
<151> 1998-02-20 

<160> 44 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 4514 
<212> DNA 

<213> Streptococcus 

<220> 

<221> CDS 

<222> (3) . . . (464) 

<221> CDS 

<222> (534) . . . (887) 
<223> 



<221> CDS 

<222> (1024) . . . (1767) 
<221> CDS 

<222> (1841) . . . (4288) 
<221> CDS 

<222> (2735) . . . (4288) 
<400> 1 

ta tct ggc aaa gag cca get aat cgt ttt agt tgg get aaa aat aaa 47 
Ser Gly Lys Glu Pro Ala Asn Arg Phe Ser Trp Ala Lys Asn Lys 
15 10 15 
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tta tta ate aat gga ttc att gca act eta gca gca act ate tta ttt 95 

Leu Leu lie Asn Gly Phe lie Ala Thr Leu Ala Ala Thr lie Leu Phe 
20 25 30 

ttt gca gtt caa ttc ata ggt ctt aaa cca gat tac cct gga aaa acc 143 

Phe Ala Val Gin Phe lie Gly Leu Lys Pro Asp Tyr Pro Gly Lys Thr 
35 40 45 

tac ttt att ate eta ttg aca gca tgg act ttg atg gca tta gta act 191 

Tyr Phe lie lie Leu Leu Thr Ala Trp Thr Leu Met Ala Leu Val Thr 
50 55 60 

get tta gtg gga tgg gat aat agg tat ggt tec ttc ttg teg tta tta 239 

Ala Leu Val Gly Trp Asp Asn Arg Tyr Gly Ser Phe Leu Ser Leu Leu 
65 70 75 

ata tta tta ttc cag ctt ggt tea age gca gga act tac cca ata gaa 287 

lie Leu Leu Phe Gin Leu Gly Ser Ser Ala Gly Thr Tyr Pro lie Glu 

80 85 90 95 

ttg agt cct aag ttc ttt caa aca att caa cca ttt tta ccg atg act 335 

Leu Ser Pro Lys Phe Phe Gin Thr lie Gin Pro Phe Leu Pro Met Thr 
100 105 110 

tac tct gtt tea gga tta aga gag acc ate teg ttg acg gga gac gtt 383 

Tyr Ser Val Ser Gly Leu Arg Glu Thr lie Ser Leu Thr Gly Asp Val 
115 120 125 

aac cat caa tgg aga atg eta gta ate ttt tta gta tea teg atg ata 431 

Asn His Gin Trp Arg Met Leu Val lie Phe Leu Val Ser Ser Met lie 
130 135 140 

ctt get ctt ctt att tat cgt aaa caa gaa gat taatagaaag tatctagtga 484 
Leu Ala Leu Leu lie Tyr Arg Lys Gin Glu Asp 
145 150 

tagactaaca gtatgatatg gtatgtcaaa gtatttagga ggagaagat atg tct act 542 

Met Ser Thr 
155 

tta aca ata att att gca aca tta act get ttg gaa cat ttt tat att 590 

Leu Thr lie lie lie Ala Thr Leu Thr Ala Leu Glu His Phe Tyr lie 
160 165 170 

atg tat ttg gag acg tta gee acc cag tea aat atg act ggg aag att 638 

Met Tyr Leu Glu Thr Leu Ala Thr Gin Ser Asn Met Thr Gly Lys lie 
175 180 185 

ttt agt atg tct aaa gaa gag ttg tea tat tta ccc gtt att aaa ctt 686 

Phe Ser Met Ser Lys Glu Glu Leu Ser Tyr Leu Pro Val lie Lys Leu 

190 195 200 205 

ttt aag aat caa ggt gta tac aac ggc ttg att ggc eta ttc etc ctt 734 

Phe Lys Asn Gin Gly Val Tyr Asn Gly Leu lie Gly Leu Phe Leu Leu 
210 215 220 
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tat ggg tta tat att tea cag aat caa gaa att gta get gtt ttt tta 782 

Tyr Gly Leu Tyr lie Ser Gin Asn Gin Glu lie Val Ala Val Phe Leu 
225 230 235 

ate aat gta ttg eta gtt get att tat ggt get ttg aca gtt gat aaa 830 

lie Asn Val Leu Leu Val Ala lie Tyr Gly Ala Leu Thr Val Asp Lys 
240 245 250 

aaa ate tta tta aaa cag ggt ggt tta cct ata tta get ctt tta aca 878 

Lys lie Leu Leu Lys Gin Gly Gly Leu Pro lie Leu Ala Leu Leu Thr 
255 260 265 

ttc tta ttt taatactact tagcegtteg atttagttga aeggctttta 927 

Phe Leu Phe 

270 

gtaatcattt ttttctcata atacaggtag tttaagtaat ttgtctttaa aaatagtata 987 

atataactac gaattcaaag agaggtgact ttgatt atg act gag aac tgg tta 1041 

Met Thr Glu Asn Trp Leu 
275 

cat act aaa gat ggt tea gat att tat tat cgt gtc gtt ggt caa ggt 1089 

His Thr Lys Asp Gly Ser Asp lie Tyr Tyr Arg Val Val Gly Gin Gly 
280 285 290 

caa ccg att gtt ttt tta cat ggc aat age tta agt agt cgc tat ttt 1137 

Gin Pro lie Val Phe Leu His Gly Asn Ser Leu Ser Ser Arg Tyr Phe 
295 300 305 310 

gat aag caa ata gca tat ttt tct aag tat tac caa gtt att gtt atg 1185 

Asp Lys Gin lie Ala Tyr Phe Ser Lys Tyr Tyr Gin Val lie Val Met 

315 320 325 

gat agt aga ggg cat ggc aaa agt cat gca aag eta aat ace att agt 1233 

Asp Ser Arg Gly His Gly Lys Ser His Ala Lys Leu Asn Thr lie Ser 
330 335 340 

ttc agg caa ata gca gtt gac tta aag gat ate tta gtt cat tta gag 1281 

Phe Arg Gin lie Ala Val Asp Leu Lys Asp lie Leu Val His Leu Glu 
345 350 355 

att gat aaa gtt ata ttg gta ggc cat age gat ggt gee aat tta get 1329 

lie Asp Lys Val lie Leu Val Gly His Ser Asp Gly Ala Asn Leu Ala 
360 365 370 

tta gtt ttt caa acg atg ttt cca ggt atg gtt aga ggg ctt ttg ctt 1377 

Leu Val Phe Gin Thr Met Phe Pro Gly Met Val Arg Gly Leu Leu Leu 
375 380 385 390 

aat tea ggg aac ctg act att cat ggt cag cga tgg tgg gat att ctt 1425 

Asn Ser Gly Asn Leu Thr lie His Gly Gin Arg Trp Trp Asp lie Leu 

395 400 405 

tta gta agg att gee tat aaa ttc ctt cac tat tta ggg aaa etc ttt 1473 

Leu Val Arg lie Ala Tyr Lys Phe Leu His Tyr Leu Gly Lys Leu Phe 
410 415 420 
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ccg tat atg agg caa aaa get caa gtt att teg ctt atg ttg gag gat 1521 

Pro Tyr Met Arg Gin Lys Ala Gin Val He Ser Leu Met Leu Glu Asp 

425 430 435 

ttg aag att agt cca get gat tta cag cat gtg tea act cct gta atg 1569 

Leu Lys He Ser Pro Ala Asp Leu Gin His Val Ser Thr Pro Val Met 

440 445 450 

gtt ttg gtt gga aat aag gac ata att aag tta aat cat tct aag aaa 1617 

Val Leu Val Gly Asn Lys Asp He He Lys Leu Asn His Ser Lys Lys 
455 460 465 470 

ctt get tct tat ttt cca agg ggg gag ttt tat tct tta gtt ggc ttt 1665 

Leu Ala Ser Tyr Phe Pro Arg Gly Glu Phe Tyr Ser Leu Val Gly Phe 
475 480 485 

ggg cat cac att att aag caa gat tec cat gtt ttt aat att att gca 1713 

Gly His His He He Lys Gin Asp Ser His Val Phe Asn He He Ala 

490 495 500 

aaa aag ttt ate aac gat acg ttg aaa gga gaa att gtt gaa aaa get 1761 

Lys Lys Phe He Asn Asp Thr Leu Lys Gly Glu He Val Glu Lys Ala 

505 510 515 

aat tga aaaagtcaaa tcactgactt ctgtgattaa aattgtattt tttatatctg 1817 
Asn * 



ttttagtgct tattattgtt gaa atg att cat ttg aaa cga act att tct gtt 1870 

Met He His Leu Lys Arg Thr He Ser Val 
520 525 

gag caa eta aag agt gtt ttt ggg caa tta tct cca atg aat ctt ttc 1918 

Glu Gin Leu Lys Ser Val Phe Gly Gin Leu Ser Pro Met Asn Leu Phe 

530 535 540 545 

tta att ate ctt gtg ggg gtt ate get gtc tta ccg aca ace gga tat 1966 

Leu He He Leu Val Gly Val He Ala Val Leu Pro Thr Thr Gly Tyr 
550 555 560 

gac ttt gta ctg aat gga ctt tta cgt aca gat aaa age aaa agg tat 2014 

Asp Phe Val Leu Asn Gly Leu Leu Arg Thr Asp Lys Ser Lys Arg Tyr 
565 570 575 

att tta cag act agt tgg tgt ate aac act ttt aat aac ttg tea gga 2062 

He Leu Gin Thr Ser Trp Cys He Asn Thr Phe Asn Asn Leu Ser Gly 
580 585 590 

ttc ggt ggc tta ate gat att ggg ttg cgc atg get ttt tat ggt aaa 2110 

Phe Gly Gly Leu He Asp He Gly Leu Arg Met Ala Phe Tyr Gly Lys 
595 600 605 

aaa ggt caa gag aag agt gac eta aga gaa gtg act cgt ttt tta ccc 2158 

Lys Gly Gin Glu Lys Ser Asp Leu Arg Glu Val Thr Arg Phe Leu Pro 

610 615 620 625 
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tat ctt att 
Tyr Leu lie 



age cat att 
Ser His lie 



tta att ggt 
Leu lie Gly 
660 

cat aaa gga 
His Lys Gly 
675 

aaa tta ggt 
Lys Leu Gly 
690 

ttt ata att 
Phe lie lie 



att tta cca 
lie Leu Pro 



att ccc ggt 
lie Pro Gly 
740 

get gec gag 
Ala Ala Glu 
755 

tat cgt tta 
Tyr Arg Leu 
770 

ttt ate cat 
Phe lie His 



ccg aaa gag 
Pro Lys Glu 



atg cgt att 
Met Arg lie 
820 

aat att act 
Asn lie Thr 
835 



tct ggt ctg 
Ser Gly Leu 
630 

ttt cat gec 
Phe His Ala 
645 

get agt atg 
Ala Ser Met 



age cat tat 
Ser His Tyr 



gtt gtt tct 
Val Val Ser 
695 

ate ggt tat 
lie Gly Tyr 
710 

eta ttt tgt 
Leu Phe Cys 
725 

gga tta gga 
Gly Leu Gly 



gga eta cct 
Gly Leu Pro 



gec tac tat 
Ala Tyr Tyr 
775 

tat tta ggt 
Tyr Leu Gly 
790 

tta gta tea 
Leu Val Ser 
805 

tta ggt gca 
Leu Gly Ala 



tat att atg 
Tyr lie Met 



tea ttt att 
Ser Phe lie 



aaa get agt 
Lys Ala Ser 
650 

tat ttt cct 
Tyr Phe Pro 
665 

ttc gga gat 
Phe Gly Asp 
680 

ttt ttt gaa 
Phe Phe Glu 



tta atg ggc 
Leu Met Gly 



att ggt tgt 
lie Gly Cys 
730 

agt ttt gaa 
Ser Phe Glu 
745 

aaa gaa act 
Lys Glu Thr 
760 

att att cca 
lie lie Pro 



agt caa ata 
Ser Gin lie 



act gtt eta 
Thr Val Leu 
810 

ttc tta ata 
Phe Leu lie 
825 

tgg ttg cag 
Trp Leu Gin 
840 



agt gtg att 
Ser Val lie 
635 

gtt gat tac 
Val Asp Tyr 



gtt att tat 
Val lie Tyr 



atg cca tct 
Met Pro Ser 
685 

tgg gga tgt 
Trp Gly Cys 
700 

att cat eta 
lie His Leu 
715 

gec gtc ggg 
Ala Val Gly 



tta gtt eta 
Leu Val Leu 



gtg gtt gca 
Val Val Ala 
765 

ttc ttt gca 
Phe Phe Ala 
780 

aat caa cgt 
Asn Gin Arg 
795 

caa ace atg 
Gin Thr Met 



ttt tea aca 
Phe Ser Thr 



aag eta ggc 
Lys Leu Gly 
845 



gee tta ate 
Ala Leu lie 
640 

tat tat ttg 
Tyr Tyr Leu 
655 

tgg att tct 
Trp lie Ser 
670 

agt act cgt 
Ser Thr Arg 



gcg gec gca 
Ala Ala Ala 



cca gtt tat 
Pro Val Tyr 
720 

att gta tec 
lie Val Ser 
735 

ttt aca ggg 
Phe Thr Gly 
750 

tgg tta tta 
Trp Leu Leu 



ggt ate tat 
Gly lie Tyr 



tat gaa aat 
Tyr Glu Asn 
800 

gtg age cat 
Val Ser His 
815 

gca ttt ttt 
Ala Phe Phe 
830 

ttg gac cca 
Leu Asp Pro 



atg 2206 
Met 



gta 2254 
Val 



ggt 2302 
Gly 



ata 2350 

He 



gca 2398 

Ala 

705 

aaa 2446 
Lys 



ctt 2494 
Leu 



ttt 2542 
Phe 



ctt 2590 
Leu 



ttc 2638 

Phe 

785 

gtc 2686 
Val 



ttg 2734 
Leu 



gaa 2782 
Glu 



tta 2830 
Leu 
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caa gaa caa atg eta tgg cag ttt cca ggt tta ttg ctg ggg gtt tgt 2878 

Gin Glu Gin Met Leu Trp Gin Phe Pro Gly Leu Leu Leu Gly Val Cys 
850 855 860 865 

ttt att etc tta get aga act att gat caa aaa gtg aaa aat get ttt 2926 

Phe lie Leu Leu Ala Arg Thr lie Asp Gin Lys Val Lys Asn Ala Phe 
870 875 880 

cca att get att ate tgg att act ttg aca ttg ttt tat ctt aat tta 2974 

Pro lie Ala lie He Trp He Thr Leu Thr Leu Phe Tyr Leu Asn Leu 

885 890 895 

ggt cat att agt tgg cga eta tct ttc tgg ttt att tta eta ttg tta 3022 

Gly His He Ser Trp Arg Leu Ser Phe Trp Phe He Leu Leu Leu Leu 
900 905 910 

ggc tta tta gtc att aag cca act etc tat aaa aaa caa ttt att tat 3070 

Gly Leu Leu Val He Lys Pro Thr Leu Tyr Lys Lys Gin Phe He Tyr 
915 92 0 925 

age tgg gaa gag cgt att aag gat gga ate att ate gtt agt tta atg 3118 

Ser Trp Glu Glu Arg He Lys Asp Gly He He He Val Ser Leu Met 
930 935 940 945 

gga gtt eta ttt tat att gca gga eta eta ttc cct ate agg get cat 3166 

Gly Val Leu Phe Tyr He Ala Gly Leu Leu Phe Pro He Arg Ala His 
950 955 960 

att aca ggt ggt agt att gaa cgc ctg cat tat ate ata gca tgg gag 3214 

He Thr Gly Gly Ser He Glu Arg Leu His Tyr He He Ala Trp Glu 

965 970 975 

ccg ata gca ttg get acg ttg att ctt act etc gtt tat tta tgt ttg 3262 

Pro He Ala Leu Ala Thr Leu He Leu Thr Leu Val Tyr Leu Cys Leu 
980 985 990 

gtt aag att tta caa gga aaa tct tgt cag att ggt gat gtg ttc aat 3310 

Val Lys He Leu Gin Gly Lys Ser Cys Gin He Gly Asp Val Phe Asn 
995 1000 1005 

gtg gat cgt tat aaa aaa eta ctt caa get tac ggt ggt tct teg gat 3358 

Val Asp Arg Tyr Lys Lys Leu Leu Gin Ala Tyr Gly Gly Ser Ser Asp 
1010 1015 1020 1025 

age ggt tta gee ttt tta aat gat aaa agg etc tac tgg tac caa aaa 3406 

Ser Gly Leu Ala Phe Leu Asn Asp Lys Arg Leu Tyr Trp Tyr Gin Lys 
1030 1035 1040 

aat gga gaa gat tgc gtt gcg ttc caa ttt gta att gtc aat aat aaa 3454 

Asn Gly Glu Asp Cys Val Ala Phe Gin Phe Val He Val Asn Asn Lys 

1045 1050 1055 

tgt ctt att atg ggg gaa cca gee ggt gat gac act tat att cgt gaa 3502 

Cys Leu He Met Gly Glu Pro Ala Gly Asp Asp Thr Tyr He Arg Glu 
1060 1065 1070 
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get att gaa teg ttt att gat gat get gat aag eta gac tat gac ctt 3550 
Ala lie Glu Ser Phe lie Asp Asp Ala Asp Lys Leu Asp Tyr Asp Leu 
1075 1080 ^ 1085 

gtt ttt tac agt att gga cag aag ttg aca eta ctt tta cat gag tat 3598 
Val Phe Tyr Ser lie Gly Gin Lys Leu Thr Leu Leu Leu His Glu Tyr 
1090 1095 1100 1105 

ggt ttt gac ttt atg aaa gtt ggt gag gat get tta gtt aat tta gaa 3646 
Gly Phe Asp Phe Met Lys Val Gly Glu Asp Ala Leu Val Asn Leu Glu 
1110 1115 1120 

acg ttt act ctt aaa ggg aat aag tac aaa cct ttc aga aat gee eta 3694 
Thr Phe Thr Leu Lys Gly Asn Lys Tyr Lys Pro Phe Arg Asn Ala Leu 
1125 1130 1135 

aat aga gtt gaa aag gat ggt ttc tat ttc gaa gtt gta caa teg cca 3742 
Asn Arg Val Glu Lys Asp Gly Phe Tyr Phe Glu Val Val Gin Ser Pro 
1140 1145 1150 

cat agt caa gag eta eta aat agt ttg gaa gag att tct aat act tgg 3790 
His Ser Gin Glu Leu Leu Asn Ser Leu Glu Glu lie Ser Asn Thr Trp 
1155 1160 1165 

tta gaa gga cgt cct gaa aaa ggt ttc tea eta gga tat ttt aat aaa 3838 
Leu Glu Gly Arg Pro Glu Lys Gly Phe Ser Leu Gly Tyr Phe Asn Lys 
1170 1175 1180 1185 

gat tat ttc caa caa gee cca ata get ttg gta aaa aat get gaa cac 3886 
Asp Tyr Phe Gin Gin Ala Pro lie Ala Leu Val Lys Asn Ala Glu His 
1190 1195 1200 

gaa gtt gtt get ttt get aat att atg cca aac tat gaa aag agt att 3934 
Glu Val Val Ala Phe Ala Asn lie Met Pro Asn Tyr Glu Lys Ser lie 
1205 1210 1215 

ate tct att gat tta atg cgt cac gat aaa cag aaa att ccg aat ggc 3982 
lie Ser lie Asp Leu Met Arg His Asp Lys Gin Lys lie Pro Asn Gly 
1220 1225 1230 

gtt atg gat ttc etc ttt tta tea tta ttc tct tat tat caa gag aag 4030 
Val Met Asp Phe Leu Phe Leu Ser Leu Phe Ser Tyr Tyr Gin Glu Lys 
1235 1240 1245 

gga tac cac tat ttt gat ttg ggg atg gca cct tta tea gga gtt ggt 4078 
Gly Tyr His Tyr Phe Asp Leu Gly Met Ala Pro Leu Ser Gly Val Gly 
1250 1255 1260 ' 1265 

cgc gtt gaa aca agt ttt get aaa gag aga atg gcg tat ctt gtc tat 4126 
Arg Val Glu Thr Ser Phe Ala Lys Glu Arg Met Ala Tyr Leu Val Tyr 
1270 1275 1280 

cat ttc ggt agt cat ttc tac tea ttt aat ggt tta cac aag tat aag 4174 
His Phe Gly Ser His Phe Tyr Ser Phe Asn Gly Leu His Lys Tyr Lys 
1285 1290 1295 
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aag aag ttt aca cca ttg tgg teg gaa cgt tat att tct tgt tct cgt 4222 
Lys Lys Phe Thr Pro Leu Trp Ser Glu Arg Tyr lie Ser Cys Ser Arg 
1300 1305 1310 

teg tec tgg tta att tgt get att tgt gee eta tta atg gaa gat agt 4270 
Ser Ser Trp Leu lie Cys Ala lie Cys Ala Leu Leu Met Glu Asp Ser 
1315 1320 1325 

aaa att aag att gtt aaa taagctttat ttggcaatta aaaagagcat 4318 
Lys lie Lys lie Val Lys 
1330 1335 

gtcatgegae atgctctttt taaatcattt aataccattg attgettgaa tctactttat 4378 

aatatgatgt gcttttaaat attgtttagc tactgtagct gctgatttat gctttacagc 4438 

tacttggtag ttcatttctt gcatttcttt ttcagtgata tgaccagcaa gtttattgag 4498 

agcttttttt acttga 4514 

<210> 2 
<211> 154 
<212> PRT 

<213> Streptococcus 
<400> 2 

Ser Gly Lys Glu Pro Ala Asn Arg Phe Ser Trp Ala Lys Asn Lys Leu 

15 10 15 

Leu lie Asn Gly Phe lie Ala Thr Leu Ala Ala Thr lie Leu Phe Phe 

20 25 30 

Ala Val Gin Phe lie Gly Leu Lys Pro Asp Tyr Pro Gly Lys Thr Tyr 

35 40 45 

Phe lie lie Leu Leu Thr Ala Trp Thr Leu Met Ala Leu Val Thr Ala 

50 55 60 

Leu Val Gly Trp Asp Asn Arg Tyr Gly Ser Phe Leu Ser Leu Leu lie 
65 70 75 80 

Leu Leu Phe Gin Leu Gly Ser Ser Ala Gly Thr Tyr Pro lie Glu Leu 

85 90 95 

Ser Pro Lys Phe Phe Gin Thr lie Gin Pro Phe Leu Pro Met Thr Tyr 

100 105 110 

Ser Val Ser Gly Leu Arg Glu Thr lie Ser Leu Thr Gly Asp Val Asn 

115 120 125 

His Gin Trp Arg Met Leu Val lie Phe Leu Val Ser Ser Met lie Leu 

130 135 140 

Ala Leu Leu lie Tyr Arg Lys Gin Glu Asp 
145 150 

<210> 3 
<211> 118 
<212> PRT 

<213> Streptococcus 
<400> 3 

Met Ser Thr Leu Thr lie lie lie Ala Thr Leu Thr Ala Leu Glu His 

1 5 10 15 

Phe Tyr lie Met Tyr Leu Glu Thr Leu Ala Thr Gin Ser Asn Met Thr 
20 25 30 
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Gly 


Lys 


lie 


Phe 


Ser 


Met 


Ser 


Lys 


Glu 


Glu 


Leu 


Ser 


Tyr 


Leu 


Pro 


Val 






35 










40 










45 








lie 


Lys 


Leu 


Phe 


Lys 


Asn 


Gin 


Gly 


Val 


Tvr 


Asn 


Glv 


Leu 


He 


Gly 


Leu 




50 










55 










60 










Phe 


Leu 


Leu 


Tyr 


Gly 


Leu 


Tyr 


He 


Ser 


Gin 


Asn 


Gin 


Glu 


He 


Val 


Ala 


65 










70 










75 










80 


Val 


Phe 


Leu 


lie 


Asn 


Val 


Leu 


Leu 


Val 


Ala 


He 


Tyr 


Gly 


Ala 


Leu 


Thr 










85 










90 










95 




Val 


Asp 


Lys 


Lys 


He 


Leu 


Leu 


Lys 


Gin 


Gly 


Gly 


Leu 


Pro 


He 


Leu 


Ala 








100 










105 










110 






Leu 


Leu 


Thr 


Phe 


Leu 


Phe 


























115 






























<210> 


4 




























<211> 


247 




























<212> 


PRT 




























<213> 


Streptococcus 






















<400> 


4 


























Met 


Thr 


Glu 


Asn 


Trp 


Leu 


His 


Thr 


Lys 


Asp 


Gly 


Ser 


Asp 


He 


Tyr 


Tyr 


1 








5 










10 










15 




Arg 


Val 


Val 


Gly 


Gin 


Gly 


Gin 


Pro 


He 


Val 


Phe 


Leu 


His 


Gly 


Asn 


Ser 








20 










25 










30 






Leu 


Ser 


Ser 


Arg 


Tyr 


Phe 


Asp 


Lys 


Gin 


He 


Ala 


Tyr 


Phe 


Ser 


Lys 


Tyr 






35 










40 










45 








Tyr 


Gin 


Val 


He 


Val 


Met 


Asp 


Ser 


Arg 


Gly 


His 


Gly 


Lys 


Ser 


His 


Ala 




50 










55 










60 










Lys 


Leu 


Asn 


Thr 


He 


Ser 


Phe 


Arg 


Gin 


He 


Ala 


Val 


Asp 


Leu 


Lys 


Asp 


65 










70 










75 










80 


lie 


Leu 


Val 


His 


Leu 


Glu 


He 


Asp 


Lys 


Val 


He 


Leu 


Val 


Gly 


His 


Ser 










85 










90 










95 




Asp 


Gly 


Ala 


Asn 


Leu 


Ala 


Leu 


Val 


Phe 


Gin 


Thr 


Met 


Phe 


Pro 


Gly 


Met 








100 










105 










110 






Val 


Arg 


Gly 


Leu 


Leu 


Leu 


Asn 


Ser 


Gly 


Asn 


Leu 


Thr 


He 


His 


Gly 


Gin 






115 










120 










125 








Arg 


Trp 


Trp 


Asp 


He 


Leu 


Leu 


Val 


Arg 


He 


Ala 


Tyr 


Lys 


Phe 


Leu 


His 




130 










135 










140 










Tyr 


Leu 


Gly 


Lys 


Leu 


Phe 


Pro 


Tyr 


Met 


Arg 


Gin 


Lys 


Ala 


Gin 


Val 


He 


145 










150 










155 










160 


Ser 


Leu 


Met 


Leu 


Glu 


Asp 


Leu 


Lys 


He 


Ser 


Pro 


Ala 


Asp 


Leu 


Gin 


His 










165 










170 










175 




Val 


Ser 


Thr 


Pro 


Val 


Met 


Val 


Leu 


Val 


Gly Asn 


Lys 


Asp 


He 


He 


Lys 








180 










185 










190 






Leu 


Asn 


His 


Ser 


Lys 


Lys 


Leu 


Ala 


Ser 


Tyr 


Phe 


Pro 


Arg 


Gly 


Glu 


Phe 






195 










200 










205 








Tyr 


Ser 


Leu 


Val 


Gly 


Phe 


Gly 


His 


His 


He 


He 


Lys 


Gin 


Asp 


Ser 


His 




210 










215 










220 










Val 


Phe 


Asn 


He 


He 


Ala 


Lys 


Lys 


Phe 


He 


Asn 


Asp 


Thr 


Leu 


Lys 


Gly 


225 










230 










235 










240 


Glu 


lie 


Val 


Glu 


Lys 


Ala 


Asn 




























245 


























<210> 


5 




























<211> 


816 




























<212> 


PRT 



























<213> Streptococcus 
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<400> 


5 


























Met 


He 


His 


Leu 


Lys 


Arg 


Thr 


He 


Ser 


Val 


Glu 


Gin 


Leu 


Lys 


Ser 


Val 


1 








5 










10 










15 




Phe 


Gly 


Gin 


Leu 


Ser 


Pro 


Met 


Asn 


Leu 


Phe 


Leu 


He 


He 


Leu 


Val 


Gly 








20 










25 










30 




Val 


He 


Ala 


Val 


Leu 


Pro 


Thr 


Thr 


Gly 


Tyr 


Asp 


Phe 


Val 


Leu 


Asn 


Gly 






35 










40 










45 








Leu 


Leu 


Arg 


Thr 


Asp 


Lys 


Ser 


Lys 


Arg 


Tyr 


He 


Leu 


Gin 


Thr 


Ser 


Trp 




50 










55 










60 








Cys 


He 


Asn 


Thr 


Phe 


Asn 


Asn 


Leu 


Ser 


Gly 


Phe 


Gly 


Gly 


Leu 


He 


Asp 


65 










70 










75 










80 


He 


Gly 


Leu 


Arg 


Met 


Ala 


Phe 


Tyr 


Gly 


Lys 


Lys 


Gly 


Gin 


Glu 


Lys 


Ser 










85 










90 










95 




Asp 


Leu 


Arg 


Glu 


Val 


Thr 


Arg 


Phe 


Leu 


Pro 


Tyr 


Leu 


He 


Ser 


Gly 


Leu 








100 










105 










110 






Ser 


Phe 


He 


Ser 


Val 


He 


Ala 


Leu 


He 


Met 


Ser 


His 


He 


Phe 


His 


Ala 






115 










120 










125 








Lys 


Ala 


Ser 


Val 


Asp 


Tyr 


Tyr 


Tyr 


Leu 


Val 


Leu 


He 


Gly 


Ala 


Ser 


Met 




130 










135 










140 










Tyr 


Phe 


Pro 


Val 


He 


Tyr 


Trp 


He 


Ser 


Gly 


His 


Lys 


Gly 


Ser 


His 


Tyr 


145 










150 










155 










160 


Phe 


Gly Asp 


Met 


Pro 


Ser 


Ser 


Thr 


Arg 


He 


Lys 


Leu 


Gly 


Val 


Val 


Ser 










165 










170 










175 




Phe 


Phe 


Glu 


Trp 


Gly 


Cys 


Ala 


Ala 


Ala 


Ala 


Phe 


He 


He 


He 


Gly 


Tyr 








180 










185 










190 






Leu 


Met 


Gly 


He 


His 


Leu 


Pro 


Val 


Tyr 


Lys 


He 


Leu 


Pro 


Leu 


Phe 


Cys 






195 










200 










205 








He 


Gly 


Cys 


Ala 


Val 


Gly 


He 


Val 


Ser 


Leu 


He 


Pro 


Gly 


Gly 


Leu 


Gly 




210 










215 










220 










Ser 


Phe 


Glu 


Leu 


Val 


Leu 


Phe 


Thr 


Gly 


Phe 


Ala 


Ala 


Glu 


Gly 


Leu 


Pro 


225 










230 










235 










240 


Lys 


Glu 


Thr 


Val 


Val 


Ala 


Trp 


Leu 


Leu 


Leu 


Tyr 


Arg 


Leu 


Ala 


Tyr 


Tyr 










245 










250 










255 




He 


He 


Pro 


Phe 


Phe 


Ala 


Gly 


He 


Tyr 


Phe 


Phe 


He 


His 


Tyr 


Leu 


Gly 








260 










265 










270 






Ser 


Gin 


He 


Asn 


Gin 


Arg 


Tyr 


Glu 


Asn 


Val 


Pro 


Lys 


Glu 


Leu 


Val 


Ser 






275 










280 










285 








Thr 


Val 


Leu 


Gin 


Thr 


Met 


Val 


Ser 


His 


Leu 


Met 


Arg 


He 


Leu 


Gly 


Ala 




290 










295 










300 










Phe 


Leu 


He 


Phe 


Ser 


Thr 


Ala 


Phe 


Phe 


Glu 


Asn 


He 


Thr 


Tyr 


He 


Met 


305 










310 










315 










320 


Trp 


Leu 


Gin 


Lys 


Leu 


Gly 


Leu 


Asp 


Pro 


Leu 


Gin 


Glu 


Gin 


Met 


Leu 


Trp 










325 










330 










335 




Gin 


Phe 


Pro 


Gly 


Leu 


Leu 


Leu 


Gly 


Val 


Cys 


Phe 


He 


Leu 


Leu 


Ala 


Arg 








340 










345 










350 






Thr 


He 


Asp 


Gin 


Lys 


Val 


Lys 


Asn 


Ala 


Phe 


Pro 


He 


Ala 


He 


lie 


Trp 






355 










360 










365 






He 


Thr 


Leu 


Thr 


Leu 


Phe 


Tyr 


Leu 


Asn 


Leu 


Gly 


His 


He 


Ser 


Trp 


Arg 




370 










375 










380 










Leu 


Ser 


Phe 


Trp 


Phe 


He 


Leu 


Leu 


Leu 


Leu 


Gly 


Leu 


Leu 


Val 


He 


Lys 


385 










390 










395 










400 


Pro 


Thr 


Leu 


Tyr 


Lys 


Lys 


Gin 


Phe 


He 


Tyr 


Ser 


Trp 


Glu 


Glu 


Arg 


He 










405 










410 










415 




Lys 


Asp 


Gly 


He 


He 


He 


Val 


Ser 


Leu 


Met 


Gly Val 


Leu 


Phe 


Tyr 


He 








420 










425 










430 
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Ala Gly Leu Leu 
435 

Glu Arg Leu His 
450 

Leu He Leu Thr 
465 

Lys Ser Cys Gin 

Leu Leu Gin Ala 
500 

Asn Asp Lys Arg 
515 

Ala Phe Gin Phe 
530 

Pro Ala Gly Asp 
545 

Asp Asp Ala Asp 

Gin Lys Leu Thr 
580 

Val Gly Glu Asp 
595 

Asn Lys Tyr Lys 
610 

Gly Phe Tyr Phe 
625 

Asn Ser Leu Glu 

Lys Gly Phe Ser 
660 

Pro He Ala Leu 
675 

Asn He Met Pro 
690 

Arg His Asp Lys 
705 

Leu Ser Leu Phe 

Leu Gly Met Ala 
740 

Ala Lys Glu Arg 
755 

Tyr Ser Phe Asn 
770 

Trp Ser Glu Arg 
785 

Ala He Cys Ala 



Phe Pro He Arg 
440 

Tyr He He Ala 
455 

Leu Val Tyr Leu 
470 

He Gly Asp Val 
485 

Tyr Gly Gly Ser 

Leu Tyr Trp Tyr 
520 

Val He Val Asn 
535 

Asp Thr Tyr He 
550 

Lys Leu Asp Tyr 
565 

Leu Leu Leu His 

Ala Leu Val Asn 
600 

Pro Phe Arg Asn 
615 

Glu Val Val Gin 
630 

Glu He Ser Asn 
645 

Leu Gly Tyr Phe 

Val Lys Asn Ala 
680 

Asn Tyr Glu Lys 
695 

Gin Lys He Pro 
710 

Ser Tyr Tyr Gin 
725 

Pro Leu Ser Gly 

Met Ala Tyr Leu 
760 

Gly Leu His Lys 
775 

Tyr He Ser Cys 
790 

Leu Leu Met Glu 
805 



Ala His He Thr 



Trp 


Glu 


Pro 


He 








460 


Cys 


Leu 


Val 


Lys 






475 




Phe 


Asn 


Val 


Asp 




490 






Ser 


Asp 


Ser 


Gly 










Gin 


Lys 


Asn 


Gly 


Asn 


Lys 


Cys 


Leu 








540 


Arg 


Glu 


Ala 


lie 






555 




Asp 


Leu 


Val 


Phe 




570 






Glu 


Tyr 


Gly 


Phe 


J O -J 








Leu 


Glu 


Thr 


Phe 


Ala 


Leu 


Asn 


Arg 








620 


Ser 


Pro 


His 


Ser 






635 




Thr 


Trp 


Leu 


Glu 




650 






Asn 


Lys 


Asp 


Tyr 


665 








Glu 


His 


Glu 


Val 


Ser 


He 


lie 


Ser 








700 


Asn 


Gly 


Val 


Met 






715 




Glu 


Lys 


Gly 


Tyr 




730 






Val 


Gly 


Arg 


Val 


745 








Val 


Tyr 


His 


Phe 


Tyr 


Lys 


Lys 


Lys 








780 


Ser 


Arg 


Ser 


Ser 






795 




Asp 


Ser 


Lys 


lie 




810 







Gly Gly Ser He 
445 

Ala Leu Ala Thr 

He Leu Gin Gly 
480 

Arg Tyr Lys Lys 
495 

Leu Ala Phe Leu 
510 

Glu Asp Cys Val 
525 

He Met Gly Glu 

Glu Ser Phe He 
560 

Tyr Ser lie Gly 
575 

Asp Phe Met Lys 
590 

Thr Leu Lys Gly 
605 

Val Glu Lys Asp 

Gin Glu Leu Leu 
640 

Gly Arg Pro Glu 
655 

Phe Gin Gin Ala 
670 

Val Ala Phe Ala 
685 

He Asp Leu Met 

Asp Phe Leu Phe 
720 

His Tyr Phe Asp 
735 

Glu Thr Ser Phe 
750 

Gly Ser His Phe 
765 

Phe Thr Pro Leu 

Trp Leu He Cys 
800 

Lys He Val Lys 
815 



<210> 6 
<211> 518 
<212> PRT 

<213> Streptococcus 



<400> 



6 
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Met Arg lie Leu 
1 

Asn lie Thr Tyr 
20 

Gin Glu Gin Met 
35 

Phe lie Leu Leu 
50 

Pro lie Ala lie 
65 

Gly His lie Ser 

Gly Leu Leu Val 
100 

Ser Trp Glu Glu 
115 

Gly Val Leu Phe 
130 

lie Thr Gly Gly 
145 

Pro lie Ala Leu 

Val Lys lie Leu 
180 

Val Asp Arg Tyr 
195 

Ser Gly Leu Ala 
210 

Asn Gly Glu Asp 
225 

Cys Leu lie Met 

Ala lie Glu Ser 
260 

Val Phe Tyr Ser 
275 

Gly Phe Asp Phe 
290 

Thr Phe Thr Leu 
305 

Asn Arg Val Glu 

His Ser Gin Glu 
340 

Leu Glu Gly Arg 
355 

Asp Tyr Phe Gin 
370 

Glu Val Val Ala 
385 

lie Ser lie Asp 

Val Met Asp Phe 
420 

Gly Tyr His Tyr 
435 



Gly Ala Phe Leu 
5 

lie Met Trp Leu 

Leu Trp Gin Phe 
40 

Ala Arg Thr lie 
55 

lie Trp lie Thr 
70 

Trp Arg Leu Ser 
85 

lie Lys Pro Thr 

Arg lie Lys Asp 
120 

Tyr lie Ala Gly 
135 

Ser lie Glu Arg 
150 

Ala Thr Leu lie 
165 

Gin Gly Lys Ser 

Lys Lys Leu Leu 
200 

Phe Leu Asn Asp 
215 

Cys Val Ala Phe 
230 

Gly Glu Pro Ala 
245 

Phe lie Asp Asp 

lie Gly Gin Lys 
280 

Met Lys Val Gly 
295 

Lys Gly Asn Lys 
310 

Lys Asp Gly Phe 
325 

Leu Leu Asn Ser 

Pro Glu Lys Gly 
360 

Gin Ala Pro lie 
375 

Phe Ala Asn lie 
390 

Leu Met Arg His 
405 

Leu Phe Leu Ser 

Phe Asp Leu Gly 
440 



12/63 



lie Phe Ser Thr 
10 

Gin Lys Leu Gly 
25 

Pro Gly Leu Leu 

Asp Gin Lys Val 
60 

Leu Thr Leu Phe 
75 

Phe Trp Phe lie 
90 

Leu Tyr Lys Lys 
105 

Gly He He He 

Leu Leu Phe Pro 
140 

Leu His Tyr He 
155 

Leu Thr Leu Val 
170 

Cys Gin He Gly 
185 

Gin Ala Tyr Gly 

Lys Arg Leu Tyr 
220 

Gin Phe Val He 
235 

Gly Asp Asp Thr 
250 

Ala Asp Lys Leu 
265 

Leu Thr Leu Leu 

Glu Asp Ala Leu 
300 

Tyr Lys Pro Phe 
315 

Tyr Phe Glu Val 
330 

Leu Glu Glu He 
345 

Phe Ser Leu Gly 

Ala Leu Val Lys 
380 

Met Pro Asn Tyr 
395 

Asp Lys Gin Lys 
410 

Leu Phe Ser Tyr 
425 

Met Ala Pro Leu 



Ala Phe Phe Glu 
15 

Leu Asp Pro Leu 
30 

Leu Gly Val Cys 
45 

Lys Asn Ala Phe 

Tyr Leu Asn Leu 
80 

Leu Leu Leu Leu 
95 

Gin Phe He Tyr 
110 

Val Ser Leu Met 
125 

He Arg Ala His 

He Ala Trp Glu 
160 

Tyr Leu Cys Leu 
175 

Asp Val Phe Asn 
190 

Gly Ser Ser Asp 
205 

Trp Tyr Gin Lys 

Val Asn Asn Lys 
240 

Tyr He Arg Glu 
255 

Asp Tyr Asp Leu 
270 

Leu His Glu Tyr 
285 

Val Asn Leu Glu 

Arg Asn Ala Leu 
320 

Val Gin Ser Pro 
335 

Ser Asn Thr Trp 
350 

Tyr Phe Asn Lys 
365 

Asn Ala Glu His 

Glu Lys Ser He 
400 

He Pro Asn Gly 
415 

Tyr Gin Glu Lys 
430 

Ser Gly Val Gly 
445 
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Arg Val Glu Thr Ser 
450 

His Phe Gly Ser His 
465 

Lys Lys Phe Thr Pro 
485 

Ser Ser Trp Leu lie 
500 

Lys He Lys He Val 
515 



Phe Ala Lys Glu Arg 
455 

Phe Tyr Ser Phe Asn 
470 

Leu Trp Ser Glu Arg 
490 

Cys Ala He Cys Ala 
505 

Lys 



Met Ala Tyr Leu Val Tyr 
460 

Gly Leu His Lys Tyr Lys 
475 480 
Tyr He Ser Cys Ser Arg 
495 

Leu Leu Met Glu Asp Ser 
510 



<210> 7 

<211> 5126 

<212> DNA 

<213> Streptococcus 

<220> 

<221> CDS 

<222> (1) . . . (687) 

<221> CDS 

<222> (701) . . . (2557) 

<221> CDS 

<222> (2566) . . . (3036) 

<221> CDS 

<222> (3106) . . . (4842) 

<221> CDS 

<222> (4850) . . . (5125) 



<400> 7 

aat ttt gat ate gaa aca aca act ttt gag gca atg aaa aag cac gcg 48 

Asn Phe Asp He Glu Thr Thr Thr Phe Glu Ala Met Lys Lys His Ala 

15 10 15 

tea tta ttg gag aaa ata tct gtt gag cgt tct ttt att gaa ttt gat 96 

Ser Leu Leu Glu Lys He Ser Val Glu Arg Ser Phe He Glu Phe Asp 
20 25 30 



aaa ctt eta tta gca cct tat tgg cgt aaa gga atg ctg gca eta ata 144 
Lys Leu Leu Leu Ala Pro Tyr Trp Arg Lys Gly Met Leu Ala Leu He 
35 40 45 



gat agt cat get ttt aat tat eta cca tgc tta aaa aat agg gaa tta 192 
Asp Ser His Ala Phe Asn Tyr Leu Pro Cys Leu Lys Asn Arg Glu Leu 
50 55 60 



caa tta age gcc ttt ttg tec cag tta gat aaa gat ttt tta ttt gag 240 
Gin Leu Ser Ala Phe Leu Ser Gin Leu Asp Lys Asp Phe Leu Phe Glu 
65 70 75 80 



aca tea gaa caa get tgg gca tea etc ate ttg agt atg gaa gtt gaa 
Thr Ser Glu Gin Ala Trp Ala Ser Leu He Leu Ser Met Glu Val Glu 
85 90 95 



288 
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cac aca aag act ttt tta aaa aaa tgg aag aca tea act cac ttt caa 336 
His Thr Lys Thr Phe Leu Lys Lys Trp Lys Thr Ser Thr His Phe Gin 
100 105 110 

aaa gat gtt gag cat ata gtg gat gtt tat cgt att cgt gaa caa atg 384 
Lys Asp Val Glu His lie Val Asp Val Tyr Arg lie Arg Glu Gin Met 
115 120 125 

gga ttg get aaa gaa cat ctt tat cgt tat gga aaa act ata ata aaa 432 
Gly Leu Ala Lys Glu His Leu Tyr Arg Tyr Gly Lys Thr lie lie Lys 
13 0 13 5 " 14 0 

caa gcg gaa ggt att cgc aaa gca aga ggc ttg atg gtt gat ttc gaa 480 
Gin Ala Glu Gly lie Arg Lys Ala Arg Gly Leu Met Val Asp Phe Glu 
145 150 155 160 

aaa ata gaa caa eta gat agt gag tta gca ate cat gat agg cat gag 528 
Lys He Glu Gin Leu Asp Ser Glu Leu Ala He His Asp Arg His Glu 
165 170 ~ 175 

ata gtt gtc aat ggt ggc acc tta ate aag aaa tta gga ata aaa cct 576 
He Val Val Asn Gly Gly Thr Leu He Lys Lys Leu Gly He Lys Pro 
180 185 190 

ggt cca cag atg gga gat att ate tct caa att gaa tta gee att gtt 624 
Gly Pro Gin Met Gly Asp He He Ser Gin He Glu Leu Ala He Val 
195 200 205 

tta gga caa ctg att aat gaa gaa gag get att tta cat ttt gtt aag 672 
Leu Gly Gin Leu He Asn Glu Glu Glu Ala He Leu His Phe Val Lys 
210 215 220 

cag tac ttg atg gat tagagaggat tat atg age gat ttt tta gta gat 721 
Gin Tyr Leu Met Asp Met Ser Asp Phe Leu Val Asp 

225 230 235 

gga ttg act aag teg gtt ggt gat aag acg gtc ttt agt aat gtt tea 769 
Gly Leu Thr Lys Ser Val Gly Asp Lys Thr Val Phe Ser Asn Val Ser 
240 245 250 

ttt ate ate cat agt tta gac cgt att ggg att att ggt gtc aat gga 817 
Phe He He His Ser Leu Asp Arg He Gly He He Gly Val Asn Gly 
255 260 265 

act gga aag aca aca eta tta gat gtt att teg ggt gaa tta ggt ttt 865 
Thr Gly Lys Thr Thr Leu Leu Asp Val He Ser Gly Glu Leu Gly Phe 
270 275 280 

gat ggt gat cgt tec cct ttt tea tea get aat gat tat aag att get 913 
Asp Gly Asp Arg Ser Pro Phe Ser Ser Ala Asn Asp Tyr Lys lie Ala 
285 290 295 300 

tat tta aaa caa gaa cca gac ttt gat gat tct cag aca att ttg gac 961 
Tyr Leu Lys Gin Glu Pro Asp Phe Asp Asp Ser Gin Thr He Leu Asp 
305 310 315 



WO 99/42588 



PCT/CA99/00114 



15/63 



ace gta ctt tct tct gac tta aga gag atg get tta att aaa gaa tat 1009 
Thr Val Leu Ser Ser Asp Leu Arg Glu Met Ala Leu lie Lys Glu Tyr 
320 325 330 

gaa tta ttg ctt aat cac tac gaa gaa agt aag caa tea cgt eta gag 1057 
Glu Leu Leu Leu Asn His Tyr Glu Glu Ser Lys Gin Ser Arg Leu Glu 
335 340 345 

aaa gta atg gca gaa atg gat tct tta gat get tgg tct att gag age 1105 
Lys Val Met Ala Glu Met Asp Ser Leu Asp Ala Trp Ser lie Glu Ser 
350 355 360 

gaa gtc aaa aca gta tta tec aaa tta ggt att act gat ttg cag ttg 1153 
Glu Val Lys Thr Val Leu Ser Lys Leu Gly lie Thr Asp Leu Gin Leu 
365 370 375 380 

teg gtt ggt gaa tta tea gga gga tta cga aga cgt gtt caa tta gcg 1201 
Ser Val Gly Glu Leu Ser Gly Gly Leu Arg Arg Arg Val Gin Leu Ala 
385 390 395 

caa gta tta tta aat gat gca gat tta ttg etc tta gac gaa cct act 1249 
Gin Val Leu Leu Asn Asp Ala Asp Leu Leu Leu Leu Asp Glu Pro Thr 
400 405 410 

aac cac tta gat att gac act att gca tgg tta acg aat ttt ttg aaa 1297 
Asn His Leu Asp lie Asp Thr lie Ala Trp Leu Thr Asn Phe Leu Lys 
415 420 425 

aat agt aaa aag aca gtg ctt ttt ata act cat gat cgt tat ttt eta 1345 
Asn Ser Lys Lys Thr Val Leu Phe lie Thr His Asp Arg Tyr Phe Leu 
430 435 440 

gac aat gtt gca aca cgt att ttt gaa tta gat aag gca cag att aca 1393 
Asp Asn Val Ala Thr Arg lie Phe Glu Leu Asp Lys Ala Gin lie Thr 
445 450 455 " 460 

gaa tat caa ggc aat tat cag gat tat gtc cga ctt cgt gca gaa caa 1441 
Glu Tyr Gin Gly Asn Tyr Gin Asp Tyr Val Arg Leu Arg Ala Glu Gin 
465 470 475 

gac gag cgt gat get get agt tta cat aaa aag aaa cag ctt tat aaa 1489 
Asp Glu Arg Asp Ala Ala Ser Leu His Lys Lys Lys Gin Leu Tyr Lys 
480 485 490 

cag gaa eta get tgg atg cgt act cag cca caa get cgt gca acg aaa 1537 
Gin Glu Leu Ala Trp Met Arg Thr Gin Pro Gin Ala Arg Ala Thr Lys 
495 500 505 

caa cag get cgt att aat cgt ttt caa aat eta aaa aac gat tta cac 1585 
Gin Gin Ala Arg lie Asn Arg Phe Gin Asn Leu Lys Asn Asp Leu His 
510 515 520 

caa aca age gat aca age gat ttg gaa atg aca ttt gaa aca agt cga 1633 
Gin Thr Ser Asp Thr Ser Asp Leu Glu Met Thr Phe Glu Thr Ser Arg 
525 530 535 540 
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att ggg aaa 
lie Gly Lys 



gat aaa tct 
Asp Lys Ser 



cgt att ggc 
Arg lie Gly 
575 

aat tta att 
Asn Leu lie 
590 

ggt gaa acg 
Gly Glu Thr 
605 

gat ggc tea 
Asp Gly Ser 



gtt aaa act 
Val Lys Thr 



ttt etc ttt 
Phe Leu Phe 
655 

ggt ggt gag 
Gly Gly Glu 
670 

cct aat gtg 
Pro Asn Val 
685 

aca tta act 
Thr Leu Thr 



att aca gtt 
He Thr Val 



att att gcg 
He He Ala 
735 

act gat tat 
Thr Asp Tyr 
750 



aag gtt att 
Lys Val He 
545 

ate ttg aaa 
He Leu Lys 
560 

ate gtt gga 
He Val Gly 



gtt caa gat 
Val Gin Asp 



ata cgt gta 
He Arg Val 
610 

aaa cgt gtt 
Lys Arg Val 
625 

agt gtc ggt 
Ser Val Gly 
640 

cca cgt teg 
Pro Arg Ser 



aaa aaa aga 
Lys Lys Arg 



tta eta ctt 
Leu Leu Leu 
690 

gtt ctt gaa 
Val Leu Glu 
705 

agt cac gat 
Ser His Asp 
720 

ttt gaa gat 
Phe Glu Asp 



tta gat gaa 
Leu Asp Glu 



aat ttt gaa 
Asn Phe Glu 



gac ttt aat 
Asp Phe Asn 
565 

gat aat ggt 
Asp Asn Gly 
580 

tta cag ccg 
Leu Gin Pro 
595 

ggt tac ttt 
Gly Tyr Phe 



att aat tat 
He Asn Tyr 



aca aca agt 
Thr Thr Ser 
645 

aca cat gga 
Thr His Gly 
660 

ctt tac ctt 
Leu Tyr Leu 
675 

gat gag ccg 
Asp Glu Pro 



aat ttt tta 
Asn Phe Leu 



cgt tac ttt 
Arg Tyr Phe 
725 

aac gat ate 
Asn Asp He 
740 

aaa gca ttt 
Lys Ala Phe 
755 



aat gtc tct 
Asn Val Ser 
550 

ttg tta att 
Leu Leu He 



gtt gga aag 
Val Gly Lys 



gat teg ggt 
Asp Ser Gly 
600 

tea caa caa 
Ser Gin Gin 
615 

ttg caa gag 
Leu Gin Glu 
630 

gtg aca gaa 
Val Thr Glu 



aca caa att 
Thr Gin He 



tta aaa ate 
Leu Lys He 
680 

aca aat gac 
Thr Asn Asp 
695 

caa ggc ttt 
Gin Gly Phe 
710 

tta gat aaa 
Leu Asp Lys 



cgt gaa ttt 
Arg Glu Phe 



aat gag caa 
Asn Glu Gin 
760 



ttt tct tac 
Phe Ser Tyr 
555 

caa aat aaa 
Gin Asn Lys 
570 

tea acc tta 
Ser Thr Leu 
585 

aat gtc tct 
Asn Val Ser 



ctt cat aat 
Leu His Asn 



gtt gca gat 
Val Ala Asp 
635 

eta ttg gaa 
Leu Leu Glu 
650 

gca aaa tta 
Ala Lys Leu 
665 

ctg att gaa 
Leu He Glu 



tta gat att 
Leu Asp He 



ggt ggt cct 
Gly Gly Pro 
715 

gtg get aat 
Val Ala Asn 
730 

ttt ggt aat 
Phe Gly Asn 
745 

aat aat gaa 
Asn Asn Glu 



cca 1681 
Pro 



gac 1729 
Asp 



ctt 1777 
Leu 



att 1825 
He 



atg 1873 

Met 

620 

gag 1921 
Glu 



caa 1969 
Gin 



tea 2017 
Ser 



aag 2065 
Lys 



get 2113 

Ala 

700 

gtg 2161 
Val 



aaa 2209 
Lys 



tat 2257 
Tyr 



gtt 2305 
Val 
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ate agt aaa aaa gag agt acc aag aca agt cgt gaa aag caa agt cgt 2353 
He Ser Lys Lys Glu Ser Thr Lys Thr Ser Arg Glu Lys Gin Ser Arg 
765 770 775 780 

aaa aga atg tct tac ttt gaa aaa caa gaa tgg gcg aca att gaa gac 2401 
Lys Arg Met Ser Tyr Phe Glu Lys Gin Glu Trp Ala Thr He Glu Asp 
785 790 7 95 

gat att atg ata ttg gaa aat act ate act cgt ata gaa aat gat atg 2449 
Asp He Met He Leu Glu Asn Thr He Thr Arg He Glu Asn Asp Met 
800 805 810 

caa aca tgt ggt agt gat ttt aca agg tta tct gat tta caa aag gaa 2497 
Gin Thr Cys Gly Ser Asp Phe Thr Arg Leu Ser Asp Leu Gin Lys Glu 
815 820 825 

tta gat gca aaa aat gaa gca ctt eta gaa aag tat gac cgt tat gag 2545 
Leu Asp Ala Lys Asn Glu Ala Leu Leu Glu Lys Tyr Asp Arg Tyr Glu 
830 835 840 

tac ctt agt gag ttagacac atg att ate cgt ccg att att aaa aat gat 2595 
Tyr Leu Ser Glu LeuAspThrMet He He Arg Pro He He Lys Asn Asp 
845 850 855 860 

gac caa gca gtt gca caa tta att cga caa agt tta cgc gee tat gat 2643 
Asp Gin Ala Val Ala Gin Leu He Arg Gin Ser Leu Arg Ala Tyr Asp 
865 870 875 

tta gat aaa cct gat aca gca tat tea gac cct cac tta gat cat ttg 2691 
Leu Asp Lys Pro Asp Thr Ala Tyr Ser Asp Pro His Leu Asp His Leu 
880 885 890 

acc tea tac tac gaa aaa ata gag aag tea gga ttc ttt gtc att gag 2739 
Thr Ser Tyr Tyr Glu Lys He Glu Lys Ser Gly Phe Phe Val He Glu 
895 900 905 

gag aga gat gag att att ggc tgt ggc ggc ttt ggt ccg ctg aaa aat 2787 
Glu Arg Asp Glu He He Gly Cys Gly Gly Phe Gly Pro Leu Lys Asn 
910 915 920 925 

eta att gca gag atg cag aag gtg tac att gca gaa cgt ttc cgt ggt 2835 
Leu He Ala Glu Met Gin Lys Val Tyr He Ala Glu Arg Phe Arg Gly 
930 935 940 

aag ggg ctt get act gat tta gtg aaa atg att gaa gta gaa get cga 2883 
Lys Gly Leu Ala Thr Asp Leu Val Lys Met He Glu Val Glu Ala Arg 
945 950 955 

aaa att ggg tat aga caa ctt tat tta gag aca gee agt act ttg agt 2931 
Lys He Gly Tyr Arg Gin Leu Tyr Leu Glu Thr Ala Ser Thr Leu Ser 
960 965 970 

agg gca act gcg gtt tat aag cat atg gga tat tgt gee tta teg caa 2979 
Arg Ala Thr Ala Val Tyr Lys His Met Gly Tyr Cys Ala Leu Ser Gin 
975 980 ' 985 
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cca ata gca aat gat caa ggt cat aca get atg gat att tgg atg att 3027 
Pro lie Ala Asn Asp Gin Gly His Thr Ala Met Asp lie Trp Met lie 
990 995 1000 1005 

aaa gat tta taagttgaaa gtggattagt gaacatggat taattatttt 3076 
Lys Asp Leu 



gagataagag gaaagaaaag gagacatat atg gca tat att tgg tct tat ttg 3129 

Met Ala Tyr lie Trp Ser Tyr Leu 
1010 1015 

aaa agg tac ccc aat tgg tta tgg ctt gat tta eta gga get atg ctt 3177 
Lys Arg Tyr Pro Asn Trp Leu Trp Leu Asp Leu Leu Gly Ala Met Leu 
1020 1025 1030 

ttt gtg acg gtt ate eta gga atg ccc aca gee tta gcg ggt atg att 3225 
Phe Val Thr Val lie Leu Gly Met Pro Thr Ala Leu Ala Gly Met lie 
1035 1040 1045 

gat aat ggc gtt aca aaa ggt gat egg act gga gtt tat ctg tgg acg 3273 
Asp Asn Gly Val Thr Lys Gly Asp Arg Thr Gly Val Tyr Leu Trp Thr 
1050 1055 1060 

ttc ate atg ttt ata ttt gtt gta eta ggt att att ggg cgt att acg 3321 
Phe lie Met Phe lie Phe Val Val Leu Gly lie lie Gly Arg lie Thr 
1065 1070 1075 1080 

atg get tac gca tct agt cgc tta acg aca aca atg att aga gat atg 3369 
Met Ala Tyr Ala Ser Ser Arg Leu Thr Thr Thr Met lie Arg Asp Met 
1085 1090 1095 

cgt aat gat atg tat get aag ctt caa gaa tac tec cat cat gaa tat 3417 
Arg Asn Asp Met Tyr Ala Lys Leu Gin Glu Tyr Ser His His Glu Tyr 
1100 1105 1110 

gaa cag ata ggt gta tct tea eta gtg aca cgt atg aca age gat act 3465 
Glu Gin lie Gly Val Ser Ser Leu Val Thr Arg Met Thr Ser Asp Thr 
1115 1120 1125 

ttt gtt ttg atg caa ttt get gaa atg tct tta cgt tta ggc eta gta 3513 
Phe Val Leu Met Gin Phe Ala Glu Met Ser Leu Arg Leu Gly Leu Val 
1130 1135 1140 

act cct atg gta atg att ttt age gtg gtt atg ata eta att acg agt 3561 
Thr Pro Met Val Met lie Phe Ser Val Val Met lie Leu lie Thr Ser 
1145 1150 1155 1160 

cca tct ttg get tgg ctt gta gcg gtt gcg atg cct ctt ttg gta gga 3609 
Pro Ser Leu Ala Trp Leu Val Ala Val Ala Met Pro Leu Leu Val Gly 
1165 1170 1175 

gtc gtt tta tat gta get ata aaa aca aaa cct tta tct gaa aga caa 3657 
Val Val Leu Tyr Val Ala lie Lys Thr Lys Pro Leu Ser Glu Arg Gin 
1180 1185 1190 
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cag act atg ctt gat aaa ate aat caa tat gtt cgt gaa aat tta aca 3705 
Gin Thr Met Leu Asp Lys lie Asn Gin Tyr Val Arg Glu Asn Leu Thr 
1195 1200 1205 

ggg tta cgc gtt gtt aga gec ttt gca aga gag aat ttt caa tea caa 3753 
Gly Leu Arg Val Val Arg Ala Phe Ala Arg Glu Asn Phe Gin Ser Gin 
1210 1215 1220 

aaa ttt caa gtc get aac caa cgt tac aca gat act tea act ggt ctt 3801 
Lys Phe Gin Val Ala Asn Gin Arg Tyr Thr Asp Thr Ser Thr Gly Leu 
1225 1230 1235 1240 

ttt aaa tta aca ggg eta aca gaa cca ctt ttc gtt caa att att att 3849 
Phe Lys Leu Thr Gly Leu Thr Glu Pro Leu Phe Val Gin lie lie lie 
1245 1250 1255 

gca atg att gtg get ate gtt tgg ttt get ttg gat ccc tta caa aga 3897 
Ala Met lie Val Ala He Val Trp Phe Ala Leu Asp Pro Leu Gin Arg 
1260 1265 ~ 1270 

ggt get att aaa ata ggg gat tta gtt get ttt ate gaa tat age ttc 3945 
Gly Ala He Lys He Gly Asp Leu Val Ala Phe He Glu Tyr Ser Phe 
1275 1280 1285 

cat get etc ttt tea ttt ttg eta ttt gee aat ctt ttt act atg tat 3993 
His Ala Leu Phe Ser Phe Leu Leu Phe Ala Asn Leu Phe Thr Met Tyr 
1290 1295 1300 

cct cgt atg gtg gta tea age cat cgt att aga gag gtg atg gat atg 4041 
Pro Arg Met Val Val Ser Ser His Arg He Arg Glu Val Met Asp Met 
1305 1310 1315 1320 

cca ate tct ate aat cct aat gee gaa ggt gtt acg gat acg aaa ctt 4089 
Pro He Ser He Asn Pro Asn Ala Glu Gly Val Thr Asp Thr Lys Leu 
1325 1330 ' 1335 

aaa ggg cat tta gaa ttt gat aat gta aca ttc get tat cca gga gaa 4137 
Lys Gly His Leu Glu Phe Asp Asn Val Thr Phe Ala Tyr Pro Gly Glu 
1340 1345 ~ 1350 

aca gag agt ccc gtt ttg cat gat att tct ttt aaa get aag cct gga 4185 
Thr Glu Ser Pro Val Leu His Asp He Ser Phe Lys Ala Lys Pro Gly 
1355 1360 1365 

gaa aca att get ttt att ggt tea aca ggt tea gga aaa tct tct ctt 4233 
Glu Thr He Ala Phe He Gly Ser Thr Gly Ser Gly Lys Ser Ser Leu 
1370 1375 1380 

gtt aat ttg att cca cgt ttt tat gat gtg aca ctt gga aaa ate tta 4281 
Val Asn Leu He Pro Arg Phe Tyr Asp Val Thr Leu Gly Lys He Leu 
1385 1390 1395 ' " 1400 

gta gat gga gtt gat gta aga gat tat aac ctt aaa tea ctt cgc caa 4329 
Val Asp Gly Val Asp Val Arg Asp Tyr Asn Leu Lys Ser Leu Arg Gin 
1405 1410 1415 
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aag att gga ttt ate ccc caa aaa get ctt tta ttt aca ggg aca ata 4377 
Lys lie Gly Phe lie Pro Gin Lys Ala Leu Leu Phe Thr Gly Thr lie 
1420 1425 1430 

gga gag aat tta aaa tat gga aaa get gat get act att gat gat ctt 4425 
Gly Glu Asn Leu Lys Tyr Gly Lys Ala Asp Ala Thr lie Asp Asp Leu 
1435 1440 1445 

aga caa gcg gtt gat att tct caa get aaa gag ttt att gag agt cac 4473 
Arg Gin Ala Val Asp lie Ser Gin Ala Lys Glu Phe lie Glu Ser His 
1450 1455 1460 

caa gaa gee ttt gaa acg cat tta get gaa ggt ggg age aat ctt tct 4521 
Gin Glu Ala Phe Glu Thr His Leu Ala Glu Gly Gly Ser Asn Leu Ser 
1465 1470 1475 1480 

999 ggt caa aaa caa egg tta tct att get agg get gtt gtt aaa gat 4569 
Gly Gly Gin Lys Gin Arg Leu Ser lie Ala Arg Ala Val Val Lys Asp 
1485 1490 1495 

cca gat tta tat att ttt gat gat tea ttt tct get etc gat tat aag 4617 
Pro Asp Leu Tyr lie Phe Asp Asp Ser Phe Ser Ala Leu Asp Tyr Lys 
1500 1505 1510 

aca gac get act tta aga gcg cgt eta aaa gaa gta ace ggt gat tct 4665 
Thr Asp Ala Thr Leu Arg Ala Arg Leu Lys Glu Val Thr Gly Asp Ser 
1515 1520 1525 

aca gtt ttg ata gtt get caa agg gtg ggt acg att atg gat get gat 4713 
Thr Val Leu lie Val Ala Gin Arg Val Gly Thr lie Met Asp Ala Asp 
1530 1535 1540 

cag att att gtc ctt gat gaa ggc gaa att gtc ggt cgt ggt ace cac 4761 
Gin lie lie Val Leu Asp Glu Gly Glu lie Val Gly Arg Gly Thr His 
1545 1550 1555 1560 

get caa tta ata gaa aat aat get att tat cgt gaa ate get gag tea 4809 
Ala Gin Leu lie Glu Asn Asn Ala lie Tyr Arg Glu lie Ala Glu Ser 
1565 1570 1575 

caa ctg aag aac caa aac tta tea gaa gga gag tgattgt atg aga aaa 4858 
Gin Leu Lys Asn Gin Asn Leu Ser Glu Gly Glu Met Arg Lys 

1580 1585 1590 

aaa tct gtt ttt ttg aga tta tgg tct tac eta act cgc tac aaa get 4906 
Lys Ser Val Phe Leu Arg Leu Trp Ser Tyr Leu Thr Arg Tyr Lys Ala 
1595 1600 1605 

act ctt ttc tta gcg att ttt ttg aaa gtt tta tct agt ttt atg agt 4954 
Thr Leu Phe Leu Ala lie Phe Leu Lys Val Leu Ser Ser Phe Met Ser 
1610 1615 1620 

gtt ctg gag cct ttt att tta ggg tta gcg ata aca gag ttg act get 5002 
Val Leu Glu Pro Phe lie Leu Gly Leu Ala lie Thr Glu Leu Thr Ala 
1625 1630 1635 
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aac 


ctt 


gtt 


gat 


atg 


get 


aag 




gtt 


tct 


ggg 
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ttg 
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Leu 


Asn 
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1640 
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C C t 


tat 


att 


get 


ggt 
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att 
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1 1 c 
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Tyr 


He 
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Gly 
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He 


He 


Tyr 
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ir lie. 


Arg 
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Ser 


Leu 


Leu 


Glu 


Lys 


He 


Ser 
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Glu 
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Phe 


He 
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Phe 
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20 
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Trp 
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He 
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He 
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<400> 9 
Met Ser Asp Phe 
1 

Thr Val Phe Ser 
20 

Gly lie He Gly 
35 

He Ser Gly Glu 
50 

Ala Asn Asp Tyr 
65 

Asp Ser Gin Thr 

Met Ala Leu He 
100 

Ser Lys Gin Ser 
115 

Asp Ala Trp Ser 
130 

Gly He Thr Asp 
145 

Arg Arg Arg Val 

Leu Leu Leu Asp 
180 

Trp Leu Thr Asn 
195 

Thr His Asp Arg 
210 

Leu Asp Lys Ala 
225 

Val Arg Leu Arg 

Lys Lys Lys Gin 
260 

Pro Gin Ala Arg 
275 

Asn Leu Lys Asn 
290 

Met Thr Phe Glu 
305 

Asn Val Ser Phe 

Leu Leu He Gin 
340 

Val Gly Lys Ser 
355 

Asp Ser Gly Asn 
370 

Ser Gin Gin Leu 
385 

Leu Gin Glu Val 

Val Thr Glu Leu 
420 



Leu Val Asp Gly 
5 

Asn Val Ser Phe 

Val Asn Gly Thr 
40 

Leu Gly Phe Asp 
55 

Lys He Ala Tyr 
70 

He Leu Asp Thr 
85 

Lys Glu Tyr Glu 

Arg Leu Glu Lys 
120 

He Glu Ser Glu 
135 

Leu Gin Leu Ser 
150 

Gin Leu Ala Gin 
165 

Glu Pro Thr Asn 

Phe Leu Lys Asn 
200 

Tyr Phe Leu Asp 
215 

Gin He Thr Glu 
230 

Ala Glu Gin Asp 
245 

Leu Tyr Lys Gin 

Ala Thr Lys Gin 
280 

Asp Leu His Gin 
295 

Thr Ser Arg He 
310 

Ser Tyr Pro Asp 
325 

Asn Lys Asp Arg 

Thr Leu Leu Asn 
360 

Val Ser He Gly 
375 

His Asn Met Asp 
390 

Ala Asp Glu Val 
405 

Leu Glu Gin Phe 



Leu Thr Lys Ser 
10 

He He His Ser 
25 

Gly Lys Thr Thr 

Gly Asp Arg Ser 
60 

Leu Lys Gin Glu 
75 

Val Leu Ser Ser 
90 

Leu Leu Leu Asn 
105 

Val Met Ala Glu 

Val Lys Thr Val 
140 

Val Gly Glu Leu 
155 

Val Leu Leu Asn 
170 

His Leu Asp He 
185 

Ser Lys Lys Thr 

Asn Val Ala Thr 
220 

Tyr Gin Gly Asn 
235 

Glu Arg Asp Ala 
250 

Glu Leu Ala Trp 
265 

Gin Ala Arg He 

Thr Ser Asp Thr 
300 

Gly Lys Lys Val 
315 

Lys Ser He Leu 
330 

He Gly He Val 
345 

Leu He Val Gin 

Glu Thr He Arg 
380 

Gly Ser Lys Arg 
395 

Lys Thr Ser Val 
410 

Leu Phe Pro Arg 
425 



Val Gly Asp Lys 
15 

Leu Asp Arg He 
30 

Leu Leu Asp Val 
45 

Pro Phe Ser Ser 

Pro Asp Phe Asp 
80 

Asp Leu Arg Glu 
95 

His Tyr Glu Glu 
110 

Met Asp Ser Leu 
125 

Leu Ser Lys Leu 

Ser Gly Gly Leu 
160 

Asp Ala Asp Leu 
175 

Asp Thr lie Ala 
190 

Val Leu Phe He 
205 

Arg He Phe Glu 

Tyr Gin Asp Tyr 
240 

Ala Ser Leu His 
255 

Met Arg Thr Gin 
270 

Asn Arg Phe Gin 
285 

Ser Asp Leu Glu 

He Asn Phe Glu 
320 

Lys Asp Phe Asn 
335 

Gly Asp Asn Gly 
350 

Asp Leu Gin Pro 
365 

Val Gly Tyr Phe 

Val He Asn Tyr 
400 

Gly Thr Thr Ser 
415 

Ser Thr His Gly 
430 
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Thr Gin lie Ala Lys Leu Ser Gly Gly Glu Lys Lys Arg Leu Tyr Leu 

435 440 445 

Leu Lys lie Leu lie Glu Lys Pro Asn Val Leu Leu Leu Asp Glu Pro 

450 455 460 

Thr Asn Asp Leu Asp lie Ala Thr Leu Thr Val Leu Glu Asn Phe Leu 

465 470 475 480 

Gin Gly Phe Gly Gly Pro Val He Thr Val Ser His Asp Arg Tyr Phe 

485 490 495 

Leu Asp Lys Val Ala Asn Lys He He Ala Phe Glu Asp Asn Asp He 

500 505 510 

Arg Glu Phe Phe Gly Asn Tyr Thr Asp Tyr Leu Asp Glu Lys Ala Phe 

515 520 ^ 525 

Asn Glu Gin Asn Asn Glu Val He Ser Lys Lys Glu Ser Thr Lys Thr 

530 535 540 

Ser Arg Glu Lys Gin Ser Arg Lys Arg Met Ser Tyr Phe Glu Lys Gin 

545 550 555 560 

Glu Trp Ala Thr He Glu Asp Asp He Met He Leu Glu Asn Thr He 

565 570 575 

Thr Arg He Glu Asn Asp Met Gin Thr Cys Gly Ser Asp Phe Thr Arg 

580 585 590 

Leu Ser Asp Leu Gin Lys Glu Leu Asp Ala Lys Asn Glu Ala Leu Leu 

595 600 605 

Glu Lys Tyr Asp Arg Tyr Glu Tyr Leu Ser Glu Leu Asp Thr 

610 615 620 

<210> 10 
<211> 157 
<212> PRT 

<213> Streptococcus 
<400> 10 

Met He He Arg Pro He He Lys Asn Asp Asp Gin Ala Val Ala Gin 

15 10 15 

Leu He Arg Gin Ser Leu Arg Ala Tyr Asp Leu Asp Lys Pro Asp Thr 

20 25 30 

Ala Tyr Ser Asp Pro His Leu Asp His Leu Thr Ser Tyr Tyr Glu Lys 

35 40 45 

lie Glu Lys Ser Gly Phe Phe Val He Glu Glu Arg Asp Glu He He 

50 55 60 

Gly Cys Gly Gly Phe Gly Pro Leu Lys Asn Leu He Ala Glu Met Gin 
65 70 75 80 

Lys Val Tyr He Ala Glu Arg Phe Arg Gly Lys Gly Leu Ala Thr Asp 

85 90 95 

Leu Val Lys Met He Glu Val Glu Ala Arg Lys He Gly Tyr Arg Gin 

100 105 110 

Leu Tyr Leu Glu Thr Ala Ser Thr Leu Ser Arg Ala Thr Ala Val Tyr 

115 120 125 

Lys His Met Gly Tyr Cys Ala Leu Ser Gin Pro He Ala Asn Asp Gin 

130 135 140 

Gly His Thr Ala Met Asp He Trp Met He Lys Asp Leu 
145 150 155 

<210> 11 

<211> 579 

<212> PRT 

<213> Streptococcus 
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<400> 11 

Met Ala Tyr lie Trp Ser Tyr Leu Lys Arg Tyr Pro Asn Trp Leu Trp 

15 10 15 

Leu Asp Leu Leu Gly Ala Met Leu Phe Val Thr Val lie Leu Gly Met 

2 0 2 5 3 0 

Pro Thr Ala Leu Ala Gly Met lie Asp Asn Gly Val Thr Lys Gly Asp 

35 40 45 

Arg Thr Gly Val Tyr Leu Trp Thr Phe lie Met Phe lie Phe Val Val 

50 55 60 

Leu Gly lie lie Gly Arg lie Thr Met Ala Tyr Ala Ser Ser Arg Leu 
65 70 75 80 

Thr Thr Thr Met lie Arg Asp Met Arg Asn Asp Met Tyr Ala Lys Leu 

85 90 95 

Gin Glu Tyr Ser His His Glu Tyr Glu Gin lie Gly Val Ser Ser Leu 

100 105 110 

Val Thr Arg Met Thr Ser Asp Thr Phe Val Leu Met Gin Phe Ala Glu 

115 120 125 

Met Ser Leu Arg Leu Gly Leu Val Thr Pro Met Val Met lie Phe Ser 

130 135 140 

Val Val Met lie Leu lie Thr Ser Pro Ser Leu Ala Trp Leu Val Ala 
145 150 155 160 

Val Ala Met Pro Leu Leu Val Gly Val Val Leu Tyr Val Ala lie Lys 

165 170 175 

Thr Lys Pro Leu Ser Glu Arg Gin Gin Thr Met Leu Asp Lys lie Asn 

180 185 190 

Gin Tyr Val Arg Glu Asn Leu Thr Gly Leu Arg Val Val Arg Ala Phe 

195 200 205 

Ala Arg Glu Asn Phe Gin Ser Gin Lys Phe Gin Val Ala Asn Gin Arg 

210 215 22 0 

Tyr Thr Asp Thr Ser Thr Gly Leu Phe Lys Leu Thr Gly Leu Thr Glu 
225 230 235 240 

Pro Leu Phe Val Gin lie lie lie Ala Met lie Val Ala lie Val Trp 

245 250 255 

Phe Ala Leu Asp Pro Leu Gin Arg Gly Ala lie Lys lie Gly Asp Leu 

260 265 270 

Val Ala Phe lie Glu Tyr Ser Phe His Ala Leu Phe Ser Phe Leu Leu 

275 280 285 

Phe Ala Asn Leu Phe Thr Met Tyr Pro Arg Met Val Val Ser Ser His 

290 295 300 

Arg lie Arg Glu Val Met Asp Met Pro lie Ser lie Asn Pro Asn Ala 
305 310 315 320 

Glu Gly Val Thr Asp Thr Lys Leu Lys Gly His Leu Glu Phe Asp Asn 

325 330 335 

Val Thr Phe Ala Tyr Pro Gly Glu Thr Glu Ser Pro Val Leu His Asp 

340 345 350 

lie Ser Phe Lys Ala Lys Pro Gly Glu Thr lie Ala Phe lie Gly Ser 

355 360 365 

Thr Gly Ser Gly Lys Ser Ser Leu Val Asn Leu lie Pro Arg Phe Tyr 

370 375 380 

Asp Val Thr Leu Gly Lys lie Leu Val Asp Gly Val Asp Val Arg Asp 
385 390 395 400 

Tyr Asn Leu Lys Ser Leu Arg Gin Lys lie Gly Phe lie Pro Gin Lys 

405 410 415 

Ala Leu Leu Phe Thr Gly Thr lie Gly Glu Asn Leu Lys Tyr Gly Lys 
420 425 430 
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Ala 


Asp 


Ala 


Thr 


He 


Asp 


Asp 


Leu 


Arg 


Gin 


Ala 


Val 


Asp 


He 


Ser 


Gin 






435 










440 










445 








Ala 


Lys 


Glu 


Phe 


He 


Glu 


Ser 


His 


Gin 


Glu 


Ala 


Phe 


Glu 


Thr 


His 


Leu 




450 










455 










460 










Ala 


Glu 


Gly 


Gly 


Ser 


Asn 


Leu 


Ser 


Gly 


Gly 


Gin 


Lys 


Gin 


Arg 


Leu 


Ser 


465 










470 










475 










480 


lie 


Ala 


Arg 


Ala 


Val 


Val 


Lys 


Asp 


Pro 


Asp 


Leu 


Tyr 


He 


Phe 


Asp 


Asp 










485 










490 










495 




Ser 


Phe 


Ser 


Ala 


Leu 


Asp 


Tyr 


Lys 


Thr 


Asp 


Ala 


Thr 


Leu 


Arg 


Ala 


Arg 








500 










505 










510 






Leu 


Lys 


Glu 


Val 


Thr 


Gly 


Asp 


Ser 


Thr 


Val 


Leu 


He 


Val 


Ala 


Gin 


Arg 






515 










520 










525 








Val 


Gly 


Thr 


He 


Met 


Asp 


Ala 


Asp 


Gin 


He 


He 


Val 


Leu 


Asp 


Glu 


Gly 




530 










535 










540 










Glu 


He 


Val 


Gly 


Arg 


Gly 


Thr 


His 


Ala 


Gin 


Leu 


He 


Glu 


Asn 


Asn 


Ala 


545 










550 










555 










560 


He 


Tyr 


Arg 


Glu 


He 


Ala 


Glu 


Ser 


Gin 


Leu 


Lys 


Asn 


Gin 


Asn 


Leu 


Ser 










565 










570 










575 




Glu 


Gly 


Glu 






























<210> 


12 




























<211> 


92 




























<212> 


PRT 




























<213> 


Streptococcus 






















<400> 


12 


























Met 


Arg 


Lys 


Lys 


Ser 


Val 


Phe 


Leu 


Arg 


Leu 


Trp 


Ser 


Tyr 


Leu 


Thr 


Arg 


1 








5 










10 










15 




Tyr 


Lys 


Ala 


Thr 


Leu 


Phe 


Leu 


Ala 


He 


Phe 


Leu 


Lys 


Val 


Leu 


Ser 


Ser 








20 










25 










30 






Phe 


Met 


Ser 


Val 


Leu 


Glu 


Pro 


Phe 


He 


Leu 


Gly 


Leu 


Ala 


He 


Thr 


Glu 






35 










40 










45 








Leu 


Thr 


Ala 


Asn 


Leu 


Val 


Asp 


Met 


Ala 


Lys 


Gly 


Val 


Ser 


Gly 


Ala 


Glu 




50 










55 










60 










Leu 


Asn 


Val 


Pro 


Tyr 


He 


Ala 


Gly 


He 


Leu 


He 


He 


Tyr 


Phe 


Phe 


Arg 


65 










70 










75 










80 


Gly 


Val 


Phe 


Tyr 


Glu 


Leu 


Gly 


Ser 


Tyr 


Gly 


Ser 


Asn 











85 90 



<210> 13 

<211> 5215 

<212> DNA 

<213> Streptococcus 

<220> 

<221> CDS 

<222> (3) . . . (122) 

<221> CDS 

<222> (133) . . . (2511) 
<221> CDS 

<222> (367) . . . (2511) 



<221> CDS 
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<222> (2946) . . . (2716) 

<223> of complementary strand 

<221> CDS 

<222> (3252) . . . (2995) 

<223> of complementary strand 

<221> CDS 

<222> (3676) . . . (3299) 

<223> of complementary strand 

<221> CDS 

<222> (4124) . . . (3837) 

<223> of complementary strand 



<221> CDS 

<222> (5214) . . . (4351) 

<223> of complementary strand 

<400> 13 

aa ttt gga agt get eta tea aca gtt gaa gta aag gag att att agt 
Phe Gly Ser Ala Leu Ser Thr Val Glu Val Lys Glu lie lie Ser 
15 10 15 



47 



gaa gaa aac ata tgg tta tat egg etc agt tgc tgc cat ttt act age 
Glu Glu Asn lie Trp Leu Tyr Arg Leu Ser Cys Cys His Phe Thr Ser 
20 25 30 



95 



tac tea tat tgg aag tta cca act tgg taagcatcat atg ggt eta gca 
Tyr Ser Tyr Trp Lys Leu Pro Thr Trp Met Gly Leu Ala 

3 5 4 0 



144 



aca aag gac aat cag att gee tat att gat gac age aaa ggt aag gca 
Thr Lys Asp Asn Gin lie Ala Tyr lie Asp Asp Ser Lys Gly Lys Ala 
45 50 55 60 



192 



aaa gee cct aaa aca aac aaa acg atg gat caa ate agt get gaa gaa 
Lys Ala Pro Lys Thr Asn Lys Thr Met Asp Gin lie Ser Ala Glu Glu 
65 70 75 



240 



ggc ate tct get gaa cag ate gta gtc aaa att act gac caa ggc tat 
Gly lie Ser Ala Glu Gin lie Val Val Lys lie Thr Asp Gin Gly Tyr 
80 85 90 



288 



gtg acc tea cac ggt gac cat tat cat ttt tac aat ggg aaa gtt cct 
Val Thr Ser His Gly Asp His Tyr His Phe Tyr Asn Gly Lys Val Pro 
95 100 105 



336 



tat gat gcg att att agt gaa gag ttg ttg atg acg gat cct aat tac 
Tyr Asp Ala lie lie Ser Glu Glu Leu Leu Met Thr Asp Pro Asn Tyr 
110 115 120 



384 



cgt ttt aaa caa tea gac gtt ate aat gaa ate tta gac ggt tac gtt 
Arg Phe Lys Gin Ser Asp Val lie Asn Glu lie Leu Asp Gly Tyr Val 
125 130 135 140 



432 
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att aaa gtc aat ggc aac tat tat gtt tac etc aag cca ggt agt aag 480 

lie Lys Val Asn Gly Asn Tyr Tyr Val Tyr Leu Lys Pro Gly Ser Lys 

145 150 155 

cgc aaa aac att cga acc aaa caa caa att get gag caa gta gec aaa 528 
Arg Lys Asn lie Arg Thr Lys Gin Gin lie Ala Glu Gin Val Ala Lys 

160 165 170 

gga act aaa gaa get aaa gaa aaa ggt tta get caa gtg gee cat etc 576 
Gly Thr Lys Glu Ala Lys Glu Lys Gly Leu Ala Gin Val Ala His Leu 

175 180 185 

agt aaa gaa gaa gtt gcg gca gtc aat gaa gca aaa aga caa gga cgc 624 

Ser Lys Glu Glu Val Ala Ala Val Asn Glu Ala Lys Arg Gin Gly Arg 
190 195 200 

tat act aca gac gat ggc tat att ttt agt ccg aca gat ate att gat 672 

Tyr Thr Thr Asp Asp Gly Tyr lie Phe Ser Pro Thr Asp lie lie Asp 

205 210 215 220 

gat tta gga gat get tat tta gta cct cat ggt aat cac tat cat tat 720 

Asp Leu Gly Asp Ala Tyr Leu Val Pro His Gly Asn His Tyr His Tyr 

225 230 235 

att cct aaa aag gat ttg tct cca agt gag eta get get gca caa gee 768 

lie Pro Lys Lys Asp Leu Ser Pro Ser Glu Leu Ala Ala Ala Gin Ala 

240 245 250 

tac tgg agt caa aaa caa ggt cga ggt get aga ccg tct gat tac cgc 816 

Tyr Trp Ser Gin Lys Gin Gly Arg Gly Ala Arg Pro Ser Asp Tyr Arg 

255 260 265 

ccg aca cca gee cca ggt cgt agg aaa gee cca att cct gat gtg acg 864 

Pro Thr Pro Ala Pro Gly Arg Arg Lys Ala Pro lie Pro Asp Val Thr 
270 275 280 

cct aac cct gga caa ggt cat cag cca gat aac ggt ggc tat cat cca 912 

Pro Asn Pro Gly Gin Gly His Gin Pro Asp Asn Gly Gly Tyr His Pro 

285 290 295 300 

gcg cct cct agg cca aat gat gcg tea caa aac aaa cac caa aga gat 960 

Ala Pro Pro Arg Pro Asn Asp Ala Ser Gin Asn Lys His Gin Arg Asp 

305 310 315 

gag ttt aaa gga aaa acc ttt aag gaa ctt tta gat caa eta cac cgt 1008 

Glu Phe Lys Gly Lys Thr Phe Lys Glu Leu Leu Asp Gin Leu His Arg 

320 325 330 

ctt gat ttg aaa tac cgt cat gtg gaa gaa gat ggg ttg att ttt gaa 1056 

Leu Asp Leu Lys Tyr Arg His Val Glu Glu Asp Gly Leu lie Phe Glu 

335 340 345 



ccg act caa gtg ate aaa tea aac get ttt ggg tat gtg gtg cct cat 1104 
Pro Thr Gin Val lie Lys Ser Asn Ala Phe Gly Tyr Val Val Pro His 
350 355 360 
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gga gat cat 
Gly Asp His 
365 

atg gaa tta 
Met Glu Leu 



tea ggt tea 
Ser Gly Ser 



ttt ctt ggt 
Phe Leu Gly 
415 

cca tat gat 
Pro Tyr Asp 
430 

tea gtg gat 
Ser Val Asp 
445 

tat ata gga 
Tyr lie Gly 



aac tgg gtg 
Asn Trp Val 



gat cag gaa 
Asp Gin Glu 
495 

agt cgc aaa 
Ser Arg Lys 
510 

gat ggt aag 
Asp Gly Lys 
525 

att gec ttt 
lie Ala Phe 



cgt tat gac 
Arg Tyr Asp 



gtg tea agt 
Val Ser Ser 
575 



tat cat att 
Tyr His lie 
370 

gca gat cga 
Ala Asp Arg 
385 

gag cac tea 
Glu His Ser 
400 

cat cgc ate 
His Arg lie 



acg agt gat 
Thr Ser Asp 



aaa tea gga 
Lys Ser Gly 
450 

ttt gga gaa 
Phe Gly Glu 
465 

aaa gca aaa 
Lys Ala Lys 
480 

caa ggc aaa 
Gin Gly Lys 



gta aca aaa 
Val Thr Lys 



gac tat ttc 
Asp Tyr Phe 
530 

gee gaa caa 
Ala Glu Gin 
545 

att gtt gac 
lie Val Asp 
560 

ctg ccg atg 
Leu Pro Met 



ate cca aga 
lie Pro Arg 



tac tta get 
Tyr Leu Ala 



aaa cca tea 
Lys Pro Ser 
405 

aaa get tac 
Lys Ala Tyr 
420 

get tat gtt 
Ala Tyr Val 
435 

gtt aca get 
Val Thr Ala 



ctt gaa caa 
Leu Glu Gin 



ggt caa get 
Gly Gin Ala 
485 

gaa aaa cca 
Glu Lys Pro 
500 

gat ggt aaa 
Asp Gly Lys 
515 

tat get cgt 
Tyr Ala Arg 



gaa eta atg 
Glu Leu Met 



aca ggt att 
Thr Gly lie 
565 

cat get ggt 
His Ala Gly 
580 



agt cag tta 
Ser Gin Leu 
375 

ggc caa act 
Gly Gin Thr 
390 

gat aaa gaa 
Asp Lys Glu 



gga aaa ggc 
Gly Lys Gly 



ttt agt aaa 
Phe Ser Lys 
440 

aaa cac gga 
Lys His Gly 
455 

tat gag ttg 
Tyr Glu Leu 
470 

gat gag ctt 
Asp Glu Leu 



etc ttt gac 
Leu Phe Asp 



gtg ggc tat 
Val Gly Tyr 
520 

gat caa ctt 
Asp Gin Leu 
535 

ctt aaa gat 
Leu Lys Asp 
550 

gag cca cga 
Glu Pro Arg 



aat get act 
Asn Ala Thr 



tea cct ctt 
Ser Pro Leu 



gag gac aat 
Glu Asp Asn 
395 

gtg aca cat 
Val Thr His 
410 

tta gat ggt 
Leu Asp Gly 
425 

gaa tec att 
Glu Ser lie 



gat cat ttc 
Asp His Phe 



gat gag gtc 
Asp Glu Val 
475 

get get get 
Ala Ala Ala 
490 

act aaa aaa 
Thr Lys Lys 
505 

atg atg cca 
Met Met Pro 



gat ttg act 
Asp Leu Thr 



aag aag cat 
Lys Lys His 
555 

ctt get gta 
Leu Ala Val 
570 

tac gat act 
Tyr Asp Thr 
585 



gaa 1152 

Glu 

380 

gac 1200 
Asp 



acc 1248 
Thr 



aaa 1296 
Lys 



cat 1344 
His 



cac 1392 

His 

460 

get 1440 
Ala 



ttg 1488 
Leu 



gtg 1536 
Val 



aaa 1584 

Lys 



cag 1632 

Gin 

540 

tac 1680 
Tyr 



gat 1728 
Asp 



gga 1776 
Gly 
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agt teg ttt 
Ser Ser Phe 
590 

tea tgg ttg 
Ser Trp Leu 
605 

cac ccc gaa 
His Pro Glu 



tea ggt teg 
Ser Gly Ser 



atg cca aac 
Met Pro Asn 
655 

eta gca gaa 
Leu Ala Glu 
670 

cga gat gtt 
Arg Asp Val 
685 

age ate cca 
Ser lie Pro 



gat eta tec 
Asp Leu Ser 



aaa aat act 
Lys Asn Thr 
735 

cag gca gat 
Gin Ala Asp 
750 

gaa gaa aaa 
Glu Glu Lys 
765 

eta gat aga 
Leu Asp Arg 



get aat ate 
Ala Asn lie 



gtt ate cca 
Val lie Pro 



acg cgc gat 
Thr Arg Asp 
610 

gtt cgt ccg 
Val Arg Pro 
625 

gtc att cca 
Val lie Pro 
640 

tgg caa att 
Trp Gin lie 



ggt cgt ttt 
Gly Arg Phe 



ttg gee aaa 
Leu Ala Lys 
690 

aga gca gat 
Arg Ala Asp 
705 

caa get gag 
Gin Ala Glu 
720 

ggt gat get 
Gly Asp Ala 



aag age aat 
Lys Ser Asn 



gaa tea gat 
Glu Ser Asp 
770 

gca ace eta 
Ala Thr Leu 
785 

gat cct aag 
Asp Pro Lys 
800 



cat att gat 
His lie Asp 
595 

cag att gca 
Gin lie Ala 



gat gta tgg 
Asp Val Trp 



aat gtt acg 
Asn Val Thr 
645 

ate cat tct 
lie His Ser 
660 

gca aca cca 
Ala Thr Pro 
675 

gaa act ttt 
Glu Thr Phe 



ggc agt tea 
Gly Ser Ser 



tgg caa caa 
Trp Gin Gin 
725 

act gat acg 
Thr Asp Thr 
740 

gaa aac caa 
Glu Asn Gin 
755 

gac ttt ata 
Asp Phe lie 



gaa gat cat 
Glu Asp His 



tat etc att 
Tyr Leu lie 
805 



cat ate cat 
His lie His 
600 

aca gtc aag 
Thr Val Lys 
615 

tct aag cca 
Ser Lys Pro 
630 

cct ctt gat 
Pro Leu Asp 



get gaa gaa 
Ala Glu Glu 



gac ggc tat 
Asp Gly Tyr 
680 

gta tgg aaa 
Val Trp Lys 
695 

ttg aga ace 
Leu Arg Thr 
710 

get caa gag 
Ala Gin Glu 



gat aaa ccc 
Asp Lys Pro 



cag cca agt 
Gin Pro Ser 
760 

gac agt tta 
Asp Ser Leu 
775 

ate aat caa 
lie Asn Gin 
790 

ttc caa cca 
Phe Gin Pro 



gtc gtt ccg 
Val Val Pro 



tat gtg atg 
Tyr Val Met 



ggg cat gaa 
Gly His Glu 
635 

aaa cgt get 
Lys Arg Ala 
650 

gtt caa aaa 
Val Gin Lys 
665 

att ttc gat 
lie Phe Asp 



gat ggc tec 
Asp Gly Ser 



att aat aaa 
lie Asn Lys 
715 

tta ttg gca 
Leu Leu Ala 
730 

aaa gaa aag 
Lys Glu Lys 
745 

gaa gee agt 
Glu Ala Ser 



cca gac tat 
Pro Asp Tyr 



tta gca caa 
Leu Ala Gin 
795 

gaa ggt gtc 
Glu Gly Val 
810 



tat 1824 
Tyr 



caa 1872 

Gin 

620 

gag 1920 
Glu 



ggt 1968 
Gly 



gec 2016 
Ala 



cca 2064 
Pro 



ttt 2112 

Phe 

700 

tct 2160 
Ser 



aag 22 08 
Lys 



caa 2256 
Gin 



aaa 2304 
Lys 



ggt 2352 

Gly 

780 

aaa 2400 
Lys 



caa 2448 
Gin 
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ttt tat aat aaa aat ggt gaa ttg gta act tat gat ate aag aca ctt 
Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp lie Lys Thr Leu 
815 820 825 



2496 



caa caa ata aac cct taaccaaaag aagatctcat tgttaaagca ctgctttgtc 
Gin Gin lie Asn Pro 
830 



2551 



aaagcaagtt acggtgattt tgaagtcatt ctatgtaacg agtagtgata aaagttggat 2611 

aatagcggtt ttcttttgea aagaaatggt atccatgtta gaatagtaaa aaaagaggag 2671 

gattcttgga ctaatgtcaa ataagtagac agaaaactgt gttattttattgcgt 2726 

taaaataatt ttcttctttc tgattagggg ttagtcctag attagcegta tgtgggttgt 2786 

aattgttata aaaattctca atgtattcaa agcagtctaa ttgaacctgt ttgatatttt 2846 

gataatgttt teggttgatt tgtctatget ttaaatactt gaaaaatget teagttaegg 2906 

cattatcata aggatatcca ggattagaaa aagaatgeat gatattggca ctgcacccta 2966 

atagtgagac gcaagaaaaa cacttttaggcaat cagtt ttctgtactg tacaggegae 3025 

tggtcgttta atctctgttg aattctagtt tcattataaa atgtaatgta atttttaaca 3085 

atatttgtta tactatcttt gttgtatttt ctcctattat ggaaataaaa ggtttcagtc 3145 

tttaggaegg tgtgaaacca ttcaatacag gcattatctg caggtgttcc ttttcgagac 3205 

attgagegga taatgtcttt ttccgtgcaa gcctggtagt aagecataga agtatacact 3265 

gagccttggt cactgtgtaa gattgetect ttatttaggcaatt ttaactgatt 3319 

aagggtgtct agtacaaaat ccgtgtcctg acaatctgag atagtgtaag ctataatttc 3379 

teggttatag agattcataa ttgatgagag atacaattta cagttaccga aatataggta 3439 

ggtaatatct gttacgagct tttccttagg ettateggea tggaaatccc gactcaattt 3499 

attatctgtt aaataataag ctttacccaa attgggaact ttcttggtac gtgtccgaca 3559 

aagccagcca ttatttttca tgatacgata gactttcttt gtattaacag teaatcegtg 3619 

gatttttttg ageaategtg taatggtacg atagecataa ataaagtgat tctccataca 3679 

gagctgttca attaattcaa taaggtcatc tttttttgcg gcttctcata ctcctttttc 3739 

caaeggtaat aggtcgaccg cttgacctta aaacagtcta gaatgaaaac tategggtag 3799 

ttgtttttat agtcttccac aagcttgata agact tactttatcgatt tccttatcaa 3857 

gcctcgatac ttttttaaga ggtcaacctg taattgtaat tgttccactt cagacagatg 3917 

ttccaagcct ttacegtagg tatattgett gccaacacct tgatgaaaac gataaagctc 3977 

ctegtttteg taccatttca tccaagtata gatttgacta ttatttttga tgcctaaagt 4037 

ctccataata actctgttag acttgcctgc tttcttcata tegatgeaag ccagcttagt 4097 

ttcccatgaa tatgettttt taaccataat aaaacattcc tgtttctagt ttactaaatt 4157 

tcaacaggag tgtttttctt ttgtctcatt ttagggattc agtgcctatt gttgtcatca 4217 

attatttttc taaattcccc ggacttaaat tgtgaccctt ggtcggaatg aaagagaagt 4277 

gttccttcaa tctttctttt attaagtgaa aaggcaacac ttttctgtac aacatttata 4337 

aagtgttttt ctaggcaattaatc ttttagtcat tggtgtttgg tagttgagac 4391 

taccatgaat gcggtggtaa ttccaccaat gaacatagtc tttagtctta agagctagtt 4451 

cttccagcaa ttgaaaggtt tcttgataaa caaattcaat tttgaaagca egataegtae 4511 

tttcagctac ggcattgtca taaggataac cagcctgact aagcgaacgt gtgattccaa 4571 

aggcttccaa tatttcatca attaactgat tatcaaactc tttgecaega tctgaatgga 4631 

acatcttgac tttggtcagg gcgtaaggga tgctttgtat ggcttgetta acgagttcag 4691 

cggtcttgtg ccaaccaaga gaeaggcega tgatttcacg gttgtatagg tcaatgatga 4751 

ggcaaacata agcccaacga ttgcctacac gaacataggt taagtcagtg actaaggctt 4811 

gtagtggtct ttcttgetta aattgcctgt ctaagtggtt gggaataggg gcttcattct 4871 

tgectctaga atgtggtttg aaggtggctt tctgataaac agaaaccaaa ttgagtcget 4931 

teataatgeg tcgaatccga cgacgtgaaa gtgtgatacc ttcgttattc aagcatattt 4991 

tgatttttct ggatcegtat ctagactcgc tatcgagaaa aattctttta atagtttctt 5051 

caaactccgt ttcagatact gactccacgg cttgatagta ataacttgag tgtggcatat 5111 

tcagccagcg acacatcttt gaaatgctgt atttatcctt attagcagtg attatttccc 5171 

tttttgtgcc ataatcaccg ctgettgett taggatatct aatt 5215 

<210> 14 

<211> 40 

<212> PRT 
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<213> Streptococcus 
<400> 14 

Phe Gly Ser Ala Leu Ser Thr Val Glu Val Lys Glu lie lie Ser Glu 

15 10 15 

Glu Asn lie Trp Leu Tyr Arg Leu Ser Cys Cys His Phe Thr Ser Tyr 

20 25 30 

Ser Tyr Trp Lys Leu Pro Thr Trp 
35 40 

<210> 15 
<211> 793 
<212> PRT 

<213> Streptococcus 
<400> 15 



Met 


Gly 


Leu 


Ala 


Thr 


Lys 


Asp 


Asn 


Gin 


He 


Ala 


Tyr 


He 


Asp 


Asp 


Ser 


1 








5 










10 










15 




Lys 


Gly 


Lys 


Ala 


Lys 


Ala 


Pro 


Lys 


Thr 


Asn 


Lys 


Thr 


Met 


Asp 


Gin 


He 








20 










25 










30 






Ser 


Ala 


Glu 


Glu 


Gly 


He 


Ser 


Ala 


Glu 


Gin 


He 


Val 


Val 


Lys 


He 


Thr 






35 










40 










45 








Asp 


Gin 


Gly 


Tyr 


Val 


Thr 


Ser 


His 


Gly Asp 


His 


Tyr 


His 


Phe 


Tyr 


Asn 




50 










55 










60 










Gly 


Lys 


Val 


Pro 


Tyr 


Asp 


Ala 


He 


He 


Ser 


Glu 


Glu 


Leu 


Leu 


Met 


Thr 


65 










70 










75 










80 


Asp 


Pro 


Asn 


Tyr 


Arg 


Phe 


Lys 


Gin 


Ser 


Asp 


Val 


He 


Asn 


Glu 


He 


Leu 










85 










90 










95 




Asp 


Gly 


Tyr 


Val 


He 


Lys 


Val 


Asn 


Gly Asn 


Tyr 


Tyr 


Val 


Tyr 


Leu 


Lys 








100 










105 










110 






Pro 


Gly 


Ser 


Lys 


Arg 


Lys 


Asn 


He 


Arg 


Thr 


Lys 


Gin 


Gin 


He 


Ala 


Glu 






115 










120 










125 








Gin 


Val 


Ala 


Lys 


Gly 


Thr 


Lys 


Glu 


Ala 


Lys 


Glu 


Lys 


Gly 


Leu 


Ala 


Gin 




130 










135 










140 










Val 


Ala 


His 


Leu 


Ser 


Lys 


Glu 


Glu 


Val 


Ala 


Ala 


Val 


Asn 


Glu 


Ala 


Lys 


145 










150 










155 










160 


Arg 


Gin 


Gly 


Arg 


Tyr 


Thr 


Thr 


Asp 


Asp 


Gly 


Tyr 


He 


Phe 


Ser 


Pro 


Thr 










165 










170 










175 




Asp 


lie 


He 


Asp 


Asp 


Leu 


Gly 


Asp 


Ala 


Tyr 


Leu 


Val 


Pro 


His 


Gly 


Asn 








180 










185 










190 






His 


Tyr 


His 


Tyr 


He 


Pro 


Lys 


Lys 


Asp 


Leu 


Ser 


Pro 


Ser 


Glu 


Leu 


Ala 






195 










200 










205 








Ala 


Ala 


Gin 


Ala 


Tyr 


Trp 


Ser 


Gin 


Lys 


Gin 


Gly Arg 


Gly Ala Arg 


Pro 




210 










215 










220 










Ser 


Asp 


Tyr 


Arg 


Pro 


Thr 


Pro 


Ala 


Pro 


Gly 


Arg 


Arg 


Lys 


Ala 


Pro 


He 


225 










230 










235 










240 


Pro 


Asp 


Val 


Thr 


Pro 


Asn 


Pro 


Gly 


Gin 


Gly 


His 


Gin 


Pro 


Asp 


Asn 


Gly 










245 










250 










255 




Gly 


Tyr 


His 


Pro 


Ala 


Pro 


Pro 


Arg 


Pro 


Asn 


Asp 


Ala 


Ser 


Gin 


Asn 


Lys 








260 










265 










270 






His 


Gin 


Arg 


Asp 


Glu 


Phe 


Lys 


Gly 


Lys 


Thr 


Phe 


Lys 


Glu 


Leu 


Leu 


Asp 






275 










280 










285 








Gin 


Leu 


His 


Arg 


Leu 


Asp 


Leu 


Lys 


Tyr 


Arg 


His 


Val 


Glu 


Glu 


Asp 


Gly 




290 










295 










300 










Leu 


He 


Phe 


Glu 


Pro 


Thr 


Gin 


Val 


He 


Lys 


Ser 


Asn 


Ala 


Phe 


Gly 


Tyr 


305 










310 










315 










320 
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Val Val Pro His 

Ser Pro Leu Glu 
340 

Glu Asp Asn Asp 
355 

Val Thr His Thr 
370 

Leu Asp Gly Lys 
385 

Glu Ser lie His 

Asp His Phe His 
420 

Asp Glu Val Ala 
435 

Ala Ala Ala Leu 
450 

Thr Lys Lys Val 
465 

Met Met Pro Lys 

Asp Leu Thr Gin 
500 

Lys Lys His Tyr 
515 

Leu Ala Val Asp 
530 

Tyr Asp Thr Gly 
545 

Val Val Pro Tyr 

Tyr Val Met Gin 
580 

Gly His Glu Glu 
595 

Lys Arg Ala Gly 
610 

Val Gin Lys Ala 
625 

lie Phe Asp Pro 

Asp Gly Ser Phe 
660 

lie Asn Lys Ser 
675 

Leu Leu Ala Lys 
690 

Lys Glu Lys Gin 
705 

Glu Ala Ser Lys 

Pro Asp Tyr Gly 
740 

Leu Ala Gin Lys 
755 



Gly Asp His Tyr 
325 

Met Glu Leu Ala 

Ser Gly Ser Glu 
360 

Phe Leu Gly His 
375 

Pro Tyr Asp Thr 
390 

Ser Val Asp Lys 
405 

Tyr lie Gly Phe 

Asn Trp Val Lys 
440 

Asp Gin Glu Gin 
455 

Ser Arg Lys Val 
470 

Asp Gly Lys Asp 
485 

lie Ala Phe Ala 

Arg Tyr Asp lie 
520 

Val Ser Ser Leu 
535 

Ser Ser Phe Val 
550 

Ser Trp Leu Thr 
565 

His Pro Glu Val 

Ser Gly Ser Val 
600 

Met Pro Asn Trp 
615 

Leu Ala Glu Gly 
630 

Arg Asp Val Leu 
645 

Ser lie Pro Arg 

Asp Leu Ser Gin 
680 

Lys Asn Thr Gly 
695 

Gin Ala Asp Lys 
710 

Glu Glu Lys Glu 
725 

Leu Asp Arg Ala 

Ala Asn lie Asp 
760 
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His lie lie Pro 
330 

Asp Arg Tyr Leu 
345 

His Ser Lys Pro 

Arg lie Lys Ala 
380 

Ser Asp Ala Tyr 
395 

Ser Gly Val Thr 
410 

Gly Glu Leu Glu 
425 

Ala Lys Gly Gin 

Gly Lys Glu Lys 
460 

Thr Lys Asp Gly 
475 

Tyr Phe Tyr Ala 
490 

Glu Gin Glu Leu 
505 

Val Asp Thr Gly 

Pro Met His Ala 
540 

lie Pro His lie 
555 

Arg Asp Gin lie 
570 

Arg Pro Asp Val 
585 

lie Pro Asn Val 

Gin He He His 
620 

Arg Phe Ala Thr 
635 

Ala Lys Glu Thr 
650 

Ala Asp Gly Ser 
665 

Ala Glu Trp Gin 

Asp Ala Thr Asp 
700 

Ser Asn Glu Asn 
715 

Ser Asp Asp Phe 
730 

Thr Leu Glu Asp 
745 

Pro Lys Tyr Leu 



Arg Ser Gin Leu 
335 

Ala Gly Gin Thr 
350 

Ser Asp Lys Glu 
365 

Tyr Gly Lys Gly 

Val Phe Ser Lys 
400 

Ala Lys His Gly 
415 

Gin Tyr Glu Leu 
430 

Ala Asp Glu Leu 
445 

Pro Leu Phe Asp 

Lys Val Gly Tyr 
480 

Arg Asp Gin Leu 
495 

Met Leu Lys Asp 
510 

He Glu Pro Arg 
525 

Gly Asn Ala Thr 

Asp His He His 
560 

Ala Thr Val Lys 
575 

Trp Ser Lys Pro 
590 

Thr Pro Leu Asp 
605 

Ser Ala Glu Glu 

Pro Asp Gly Tyr 
640 

Phe Val Trp Lys 
655 

Ser Leu Arg Thr 
670 

Gin Ala Gin Glu 
685 

Thr Asp Lys Pro 

Gin Gin Pro Ser 
720 

He Asp Ser Leu 
735 

His He Asn Gin 
750 

He Phe Gin Pro 
765 
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Glu Gly Val Gin Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp 

770 775 780 

lie Lys Thr Leu Gin Gin lie Asn Pro 
785 790 



<210> 16 

<211> 715 

<212> PRT 

<213> Streptococcus 



<400> 16 
Met Thr Asp Pro 
1 

lie Leu Asp Gly 
20 

Leu Lys Pro Gly 
35 

Ala Glu Gin Val 
50 

Ala Gin Val Ala 
65 

Ala Lys Arg Gin 

Pro Thr Asp lie 
100 

Gly Asn His Tyr 
115 

Leu Ala Ala Ala 
130 

Arg Pro Ser Asp 
145 

Pro lie Pro Asp 

Asn Gly Gly Tyr 
180 

Asn Lys His Gin 
195 

Leu Asp Gin Leu 
210 

Asp Gly Leu lie 
225 

Gly Tyr Val Val 

Gin Leu Ser Pro 
260 

Gin Thr Glu Asp 
275 

Lys Glu Val Thr 
290 

Lys Gly Leu Asp 
305 

Ser Lys Glu Ser 

His Gly Asp His 
340 



Asn Tyr Arg Phe 
5 

Tyr Val He Lys 

Ser Lys Arg Lys 
40 

Ala Lys Gly Thr 
55 

His Leu Ser Lys 
70 

Gly Arg Tyr Thr 
85 

He Asp Asp Leu 

His Tyr He Pro 
120 

Gin Ala Tyr Trp 
135 

Tyr Arg Pro Thr 
150 

Val Thr Pro Asn 
165 

His Pro Ala Pro 

Arg Asp Glu Phe 
200 

His Arg Leu Asp 
215 

Phe Glu Pro Thr 
230 

Pro His Gly Asp 
245 

Leu Glu Met Glu 

Asn Asp Ser Gly 
280 

His Thr Phe Leu 
295 

Gly Lys Pro Tyr 
310 

He His Ser Val 
325 

Phe His Tyr He 



Lys 


Gin 


Ser 


Asp 




10 






Val 


Asn 


Gly 


Asn 


25 








Asn 


He 


Arq 


Thr 


Lys 


Glu 


Ala 


Lys 








60 


Glu 


Glu 


Val 


Ala 






75 




Thr 


Asp 


Asp 


Gly 




90 






Gly 


Asp 


Ala 


Tyr 


105 








Lvs 


Lvs 


Asp 


Leu 


Ser 


Gin 


Lys 


Gin 








140 


Pro 


Ala 


Pro 


Gly 






155 




Pro 


Gly 


Gin 


Gly 




170 






Pro 


Arg 


Pro 


Asn 


185 








Lys 


Gly 


Lys 


Thr 


Leu 


Lys 


Tyr 


Arg 








220 


Gin 


Val 


He 


Lys 






235 




His 


Tyr 


His 


He 




250 






Leu 


Ala 


Asp 


Arg 


265 








Ser 


Glu 


His 


Ser 


Gly 


His 


Arg 


He 








300 


Asp 


Thr 


Ser 


Asp 






315 




Asp 


Lys 


Ser 


Gly 




330 






Gly 


Phe 


Gly 


Glu 


345 









Val He Asn Glu 
15 

Tyr Tyr Val Tyr 
30 

Lys Gin Gin He 
45 

Glu Lys Gly Leu 

Ala Val Asn Glu 
80 

Tyr He Phe Ser 
95 

Leu Val Pro His 
110 

Ser Pro Ser Glu 
125 

Gly Arg Gly Ala 

Arg Arg Lys Ala 
160 

His Gin Pro Asp 
175 

Asp Ala Ser Gin 
190 

Phe Lys Glu Leu 
205 

His Val Glu Glu 

Ser Asn Ala Phe 
240 

He Pro Arg Ser 
255 

Tyr Leu Ala Gly 
270 

Lys Pro Ser Asp 
285 

Lys Ala Tyr Gly 

Ala Tyr Val Phe 
320 

Val Thr Ala Lys 
335 

Leu Glu Gin Tyr 
350 
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Glu 


Leu 


Asp 


Glu 


Val 


Ala 


Asn 


Trp 


Val 


Lys 


Ala 


Lys 


Gly 


Gin 


Ala 


Asp 






355 










360 










365 








Glu 


Leu 


Ala 


Ala 


Ala 


Leu 


Asp 


Gin 


Glu 


Gin 


Gly 


Lys 


Glu 


Lys 


Pro 


Leu 




370 










375 










380 










Phe 


Asp 


Thr 


Lys 


Lys 


Val 


Ser 


Arg 


Lys 


Val 


Thr 


Lys 


Asp 


Gly 


Lys 


Val 


385 










390 










395 










400 


Gly 


Tyr 


Met 


Met 


Pro 


Lys 


Asp 


Gly 


Lys 


Asp 


Tyr 


Phe 


Tyr 


Ala 


Arg 


Asp 










405 










410 










415 




Gin 


Leu 


Asp 


Leu 


Thr 


Gin 


He 


Ala 


Phe 


Ala 


Glu 


Gin 


Glu 


Leu 


Met 


Leu 








420 










425 










430 






Lys 


Asp 


Lys 


Lys 


His 


Tyr 


Arg 


Tyr 


Asp 


He 


Val 


Asp 


Thr 


Gly 


He 


Glu 






435 










440 










445 








Pro 


Arg 


Leu 


Ala 


Val 


Asp 


Val 


Ser 


Ser 


Leu 


Pro 


Met 


His 


Ala 


Gly 


Asn 




450 










455 










460 










Ala 


Thr 


Tyr 


Asp 


Thr 


Gly 


Ser 


Ser 


Phe 


Val 


He 


Pro 


His 


He 


Asp 


His 


465 










470 










475 










480 


He 


His 


Val 


Val 


Pro 


Tyr 


Ser 


Trp 


Leu 


Thr 


Arg 


Asp 


Gin 


lie 


Ala 


Thr 










485 










490 










495 




Val 


Lys 


Tyr 


Val 


Met 


Gin 


His 


Pro 


Glu 


Val 


Arg 


Pro 


Asp 


Val 


Trp 


Ser 








500 










505 










510 






Lys 


Pro 


Gly 


His 


Glu 


Glu 


Ser 


Gly 


Ser 


Val 


He 


Pro 


Asn 


Val 


Thr 


Pro 






515 










520 










525 








Leu 


Asp 


Lys 


Arg 


Ala 


Gly 


Met 


Pro 


Asn 


Trp 


Gin 


lie 


He 


His 


Ser 


Ala 




530 










535 










540 










Glu 


Glu 


Val 


Gin 


Lys 


Ala 


Leu 


Ala 


Glu 


Gly 


Arg 


Phe 


Ala 


Thr 


Pro 


Asp 


545 










550 










555 










560 


Gly 


Tyr 


He 


Phe 


Asp 


Pro 


Arg 


Asp 


Val 


Leu 


Ala 


Lys 


Glu 


Thr 


Phe 


Val 










565 










570 










575 




Trp 


Lys 


Asp 


Gly 


Ser 


Phe 


Ser 


He 


Pro 


Arg 


Ala 


Asp 


Gly 


Ser 


Ser 


Leu 








580 










585 










590 






Arg 


Thr 


He 


Asn 


Lys 


Ser 


Asp 


Leu 


Ser 


Gin 


Ala 


Glu 


Trp 


Gin 


Gin 


Ala 






595 










600 










605 








Gin 


Glu 


Leu 


Leu 


Ala 


Lys 


Lys 


Asn 


Thr 


Gly Asp 


Ala 


Thr 


Asp 


Thr 


Asp 




610 










615 










620 










Lys 


Pro 


Lys 


Glu 


Lys 


Gin 


Gin 


Ala 


Asp 


Lys 


Ser 


Asn 


Glu 


Asn 


Gin 


Gin 


625 










630 










635 










640 


Pro 


Ser 


Glu 


Ala 


Ser 


Lys 


Glu 


Glu 


Lys 


Glu 


Ser 


Asp 


Asp 


Phe 


He 


Asp 










645 










650 










655 




Ser 


Leu 


Pro 


Asp 


Tyr 


Gly 


Leu 


Asp 


Arg 


Ala 


Thr 


Leu 


Glu 


Asp 


His 


He 








660 










665 










670 






Asn 


Gin 


Leu 


Ala 


Gin 


Lys 


Ala 


Asn 


He 


Asp 


Pro 


Lys 


Tyr 


Leu 


He 


Phe 






675 










680 










685 








Gin 


Pro 


Glu 


Gly Val 


Gin 


Phe 


Tyr 


Asn 


Lys 


Asn 


Gly 


Glu 


Leu 


Val 


Thr 




690 










695 










700 










Tyr 


Asp 


He 


Lys 


Thr 


Leu 


Gin 


Gin 


He 


Asn 


Pro 













705 710 715 



<210> 17 
<211> 77 
<212> PRT 

<213> Streptococcus 
<400> 17 

Met His Ser Phe Ser Asn Pro Gly Tyr Pro Tyr Asp Asn Ala Val Thr 
1 5 10 15 
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Glu Ala Phe Phe Lys 
20 

Tyr Gin Asn lie Lys 
35 

Asn Phe Tyr Asn Asn 
50 

Pro Asn Gin Lys Glu 
65 



Tyr Leu Lys His Arg Gin 
25 

Gin Val Gin Leu Asp Cys 
40 

Tyr Asn Pro His Thr Ala 
55 

Glu Asn Tyr Phe Asn Ala 
70 75 



lie Asn Arg Lys His 
30 

Phe Glu Tyr lie Glu 
45 

Asn Leu Gly Leu Thr 
60 

lie Lys 



<210> 18 
<211> 86 
<212> PRT 

<213 > Streptococcus 



Met 


Ala 


Tyr 


Tyr 


Gin 


Ala 


Cys 


Thr 


Glu 


Lys 


Asp 


He 


He 


Arg 


Ser 


Met 


1 








5 










10 










15 




Ser 


Arg 


Lys 


Gly 
20 


Thr 


Pro 


Ala 


Asp 


Asn 

25 


Ala 


Cys 


He 


Glu 


Trp 
30 


Phe 


His 


Thr 


Val 


Leu 
35 


Lys 


Thr 


Glu 


Thr 


Phe 
40 


Tyr 


Phe 


His 


Asn 


Arg 
45 


Arg 


Lys 


Tyr 


Asn 


Lys 
50 


Asp 


Ser 


He 


Thr 


Asn 
55 


He 


Val 


Lys 


Asn 


Tyr 
60 


He 


Thr 


Phe 


Tyr 


Asn 


Glu 


Thr 


Arg 


He 


Gin 


Gin 


Arg 


Leu 


Asn 


Asp 


Gin 


Ser 


Pro 


Val 


Gin 


65 










70 










75 










80 


Tyr 


Arg 


Lys 


Leu 


He 
85 


Ala 























<210> 19 
<211> 126 
<212> PRT 

<213> Streptococcus 



<400> 19 
Met Glu Asn His 
1 

Lys Lys He His 
20 

Met Lys Asn Asn 
35 

Asn Leu Gly Lys 
50 

Phe His Ala Asp 
65 

Leu Tyr Phe Gly 

Tyr Asn Arg Glu 
100 

Asp Phe Val Leu 
115 



Phe 


He 


Tyr 


Gly 


5 








Gly 


Leu 


Thr 


Val 


Gly 


Trp 


Leu 


Cys 








40 


Ala 


Tyr 


Tyr 


Leu 






55 




Lys 


Pro 


Lys 


Glu 




70 






Asn 


Cys 


L Y S 


Leu 


85 








He 


He 


Ala 


Tyr 


Asp 


Thr 


Leu 


Asn 








120 



Tyr Arg Thr He 
10 

Asn Thr Lys Lys 
25 

Arg Thr Arg Thr 

Thr Asp Asn Lys 
60 

Lys Leu Val Thr 
75 

Tyr Leu Ser Ser 
90 

Thr He Ser Asp 
105 

Gin Leu Lys Leu 



Thr 


Arg 


Leu 


Leu 






15 




Val 


Tyr 


Arg 


He 




30 






Lys 


Lys 


Val 


Pro 


45 








Leu 


Ser 


Arg 


Asp 


Asp 


He 


Thr 


Tyr 








80 


He 


Met 


Asn 


Leu 






95 




Cys 


Gin 


Asp 


Thr 




110 






Pro 


Lys 






125 









<210> 20 
<211> 96 
<212> PRT 

<213> Streptococcus 
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<400> 


20 


Met 


Val 


Lys 


Lys 


1 








Met 


Lys 


Lys 


Ala 








20 


lie 


Lys 


Asn 


Asn 






35 




Glu 


Glu 


Leu 


Tyr 




50 






Gly 


Lys 


Gly 


Leu 


65 








Val 


Asp 


Leu 


Leu 



Ala Tyr Ser Trp 
5 

Gly Lys Ser Asn 

Ser Gin lie Tyr 
40 

Arg Phe His Gin 
55 

Glu His Leu Ser 
70 

Lys Lys Tyr Arg 
85 



Glu Thr Lys Leu 
10 

Arg Val He Met 
25 

Thr Trp Met Lys 

Gly Val Gly Lys 
60 

Glu Val Glu Gin 
75 

Gly Leu He Arg 
90 



Ala Cys He Asp 
15 

Glu Thr Leu Gly 
30 

Trp Tyr Glu Asn 
45 

Gin Tyr Thr Tyr 

Leu Gin Leu Gin 
80 

Lys Ser He Lys 
95 



<210> 21 

<211> 288 

<212> PRT 

<213> streptococus 



<400> 21 



He 


Arg 


Tyr 


Pro 


Lys 


Ala 


Ser 


Ser 


Gly Asp 


Tyr 


Gly 


Thr 


Lys 


Arg 


Glu 


1 








5 










10 










15 




He 


He 


Thr 


Ala 


Asn 


Lys 


Asp 


Lys 


Tyr 


Ser 


He 


Ser 


Lys 


Met 


Cys 


Arg 








20 










25 










30 






Trp 


Leu 


Asn 


Met 


Pro 


His 


Ser 


Ser 


Tyr 


Tyr 


Tyr 


Gin 


Ala 


Val 


Glu 


Ser 






35 










40 










45 








Val 


Ser 


Glu 


Thr 


Glu 


Phe 


Glu 


Glu 


Thr 


He 


Lys 


Arg 


lie 


Phe 


Leu 


Asp 




50 










55 










60 










Ser 


Glu 


Ser 


Arg 


Tyr 


Gly 


Ser 


Arg 


Lys 


He 


Lys 


He 


Cys 


Leu 


Asn 


Asn 


65 










70 










75 










80 


Glu 


Gly 


He 


Thr 


Leu 


Ser 


Arg 


Arg 


Arg 


He 


Arg 


Arg 


He 


Met 


Lys 


Arg 










85 










90 










95 




Leu 


Asn 


Leu 


Val 


Ser 


Val 


Tyr 


Gin 


Lys 


Ala 


Thr 


Phe 


Lys 


Pro 


His 


Ser 








100 










105 










110 






Arg 


Gly 


Lys 


Asn 


Glu 


Ala 


Pro 


He 


Pro 


Asn 


His 


Leu 


Asp 


Arg 


Gin 


Phe 






115 










120 










125 








Lys 


Gin 


Glu 


Arg 


Pro 


Leu 


Gin 


Ala 


Leu 


Val 


Thr 


Asp 


Leu 


Thr 


Tyr 


Val 




130 










135 










140 










Arg 


Val 


Gly 


Asn 


Arg 


Trp 


Ala 


Tyr 


Val 


Cys 


Leu 


He 


He 


Asp 


Leu 


Tyr 


145 










150 










155 










160 


Asn 


Arg 


Glu 


He 


He 


Gly 


Leu 


Ser 


Leu 


Gly 


Trp 


His 


Lys 


Thr 


Ala 


Glu 










165 










170 










175 




Leu 


Val 


Lys 


Gin 


Ala 


He 


Gin 


Ser 


He 


Pro 


Tyr 


Ala 


Leu 


Thr 


Lys 


Val 








180 










185 










190 






Lys 


Met 


Phe 


His 


Ser 


Asp 


Arg 


Gly 


Lys 


Glu 


Phe 


Asp 


Asn 


Gin 


Leu 


He 






195 










200 










205 








Asp 


Glu 


He 


Leu 


Glu 


Ala 


Phe 


Gly 


He 


Thr 


Arg 


Ser 


Leu 


Ser 


Gin 


Ala 




210 










215 










220 










Gly 


Tyr 


Pro 


Tyr 


Asp 


Asn 


Ala 


Val 


Ala 


Glu 


Ser 


Thr 


Tyr 


Arg 


Ala 


Phe 


225 










230 










235 










240 


Lys 


He 


Glu 


Phe 


Val 


Tyr 


Gin 


Glu 


Thr 


Phe 


Gin 


Leu 


Leu 


Glu 


Glu 


Leu 










245 










250 










255 




Ala 


Leu 


Lys 


Thr 


Lys 


Asp 


Tyr 


Val 


His 


Trp 


Trp 


Asn 


Tyr 


His 


Arg 


He 








260 










265 










270 






His 


Gly 


Ser 


Leu 


Asn 


Tyr 


Gin 


Thr 


Pro 


Met 


Thr 


Lys 


Arg 


Leu 


He 


Ala 






275 










280 










285 
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<210> 22 

<211> 5058 

<212> DNA 

<213> streptococcus 

<220> 

<221> CDS 

<222> (1) . . . (663) 

<221> CDS 

<222> (763) . . . (1344) 

<221> CDS 

<222> (1362) . . . (1739) 

<221> CDS 

<222> (2266) . . . (5058) 



<400> 22 

aat ttg aaa gca gaa tta tct gta gaa gat gag caa tat aca gca aca 

Asn Leu Lys Ala Glu Leu Ser Val Glu Asp Glu Gin Tyr Thr Ala Thr 
15 10 15 



48 



gtt tat ggt aaa tct get cat ggt tea aca cca caa gaa ggt gtt aat 96 

Val Tyr Gly Lys Ser Ala His Gly Ser Thr Pro Gin Glu Gly Val Asn 

20 25 30 

999 9 C 9 act ta t tta get ctt tat eta agt caa ttt gat ttt gaa ggt 144 

Gly Ala Thr Tyr Leu Ala Leu Tyr Leu Ser Gin Phe Asp Phe Glu Gly 
35 40 45 

cct get cgt get ttc tta gat gtt aca gee aac att att cac gaa gac 192 

Pro Ala Arg Ala Phe Leu Asp Val Thr Ala Asn lie lie His Glu Asp 
50 55 60 

ttc tea ggt gaa aaa ctt gga gta get tat gaa gat gac tgt atg gga 240 

Phe Ser Gly Glu Lys Leu Gly Val Ala Tyr Glu Asp Asp Cys Met Gly 

65 70 75 80 

cca ttg age atg aat gca ggt gtc ttc cag ttt gat gaa act aat gat 288 

Pro Leu Ser Met Asn Ala Gly Val Phe Gin Phe Asp Glu Thr Asn Asp 

85 90 95 

gat aat act ate get ctt aat ttc cgt tac cca caa ggg aca gat get 336 

Asp Asn Thr lie Ala Leu Asn Phe Arg Tyr Pro Gin Gly Thr Asp Ala 

100 105 110 

aaa act ate caa act aag ctt gag aaa ctt aac gga gtt gaa aaa gtg 384 

Lys Thr lie Gin Thr Lys Leu Glu Lys Leu Asn Gly Val Glu Lys Val 
115 120 125 



act ctt tct gac cat gaa cac aca cca cac tat gta cct atg gac gat 
Thr Leu Ser Asp His Glu His Thr Pro His Tyr Val Pro Met Asp Asp 
13 0 135 14 0 



432 
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gaa tta gta tea acc tta eta get gtc tat gaa aag caa act ggt ctt 480 

Glu Leu Val Ser Thr Leu Leu Ala Val Tyr Glu Lys Gin Thr Gly Leu 

145 150 155 160 

aaa gga cat gaa cag gtt att ggt ggt ggg aca ttt ggt cgc tta ctt 528 

Lys Gly His Glu Gin Val lie Gly Gly Gly Thr Phe Gly Arg Leu Leu 
165 170 175 

gaa egg ggt gtt gca tac ggt gee atg ttc cca gga gat gaa aac act 576 

Glu Arg Gly Val Ala Tyr Gly Ala Met Phe Pro Gly Asp Glu Asn Thr 
180 185 190 

atg cat caa get aat gag tac atg cct tta gaa aat att ttc cgt teg 624 

Met His Gin Ala Asn Glu Tyr Met Pro Leu Glu Asn lie Phe Arg Ser 
195 200 205 

get get ate tac gca gaa get ate tat gaa tta ate aaa taaaataatc 673 
Ala Ala lie Tyr Ala Glu Ala lie Tyr Glu Leu lie Lys 
210 215 220 

cttaaactaa atatgtgatc aatgataaag ggtggtgaag acatgaaagt gtctttgect 733 

cttttcataa ggttagattt ggagacttt atg act gac ttg gaa aaa att att 786 

Met Thr Asp Leu Glu Lys lie lie 
225 

aaa gca ata aaa agt gat tea cag aat caa aat tat aca gaa aat ggt 834 

Lys Ala lie Lys Ser Asp Ser Gin Asn Gin Asn Tyr Thr Glu Asn Gly 

230 235 240 245 

att gat cct ttg ttt get get cct aaa aca get agg ate aat att gtt 882 

lie Asp Pro Leu Phe Ala Ala Pro Lys Thr Ala Arg lie Asn lie Val 
250 255 260 

ggc caa gca cct ggt tta aaa act caa gaa gca aga etc tat tgg aaa 930 

Gly Gin Ala Pro Gly Leu Lys Thr Gin Glu Ala Arg Leu Tyr Trp Lys 
265 270 275 

gat aaa tct gga gat cgt eta cgc cag tgg ctt gga gtt gat gaa gag 978 

Asp Lys Ser Gly Asp Arg Leu Arg Gin Trp Leu Gly Val Asp Glu Glu 
280 285 290 

aca ttt tac cat tct gga aaa ttt get gtt tta cct tta gat ttt tat 1026 

Thr Phe Tyr His Ser Gly Lys Phe Ala Val Leu Pro Leu Asp Phe Tyr 
295 300 305 

tac cca ggc aaa gga aaa tea gga gat tta ccc cct aga aaa ggt ttt 1074 

Tyr Pro Gly Lys Gly Lys Ser Gly Asp Leu Pro Pro Arg Lys Gly Phe 

310 315 320 325 

gcg gag aaa tgg cac cct ctt att tta aaa gaa atg cct aat gtt caa 1122 

Ala Glu Lys Trp His Pro Leu lie Leu Lys Glu Met Pro Asn Val Gin 
330 335 340 

ttg acc ttg eta gtt ggt cag tat get cag aaa tat tat ctt gga age 1170 

Leu Thr Leu Leu Val Gly Gin Tyr Ala Gin Lys Tyr Tyr Leu Gly Ser 
345 350 355 
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tec gca cat aaa aat eta aca gaa aca gtt aaa get tac aaa gac tat 
Ser Ala His Lys Asn Leu Thr Glu Thr Val Lys Ala Tyr Lys Asp Tyr 
360 365 370 



1218 



eta ccc gat tat tta ccc ctg gtt cac cca tea ccg cga aat caa att 
Leu Pro Asp Tyr Leu Pro Leu Val His Pro Ser Pro Arg Asn Gin lie 
375 380 385 



1266 



tgg eta aag aag aat cca tgg ttt gaa aaa gat eta ate gtt gat tta 
Trp Leu Lys Lys Asn Pro Trp Phe Glu Lys Asp Leu lie Val Asp Leu 
390 395 400 405 



1314 



caa aag ata gta gca gat att tta aaa gat taaggatagg agttggt atg 
Gin Lys lie Val Ala Asp lie Leu Lys Asp Met 
410 415 



1364 



aga gat aat cat eta cac acg tat ttt tec tat gat tgt caa acg gca 
Arg Asp Asn His Leu His Thr Tyr Phe Ser Tyr Asp Cys Gin Thr Ala 
420 425 430 



1412 



ttt gag gac tat att aat ggt ttt aca ggt gaa ttt ate acg aca gaa 1460 

Phe Glu Asp Tyr lie Asn Gly Phe Thr Gly Glu Phe lie Thr Thr Glu 
435 440 445 

cat ttt gat tta tea aat cct tac ace ggt caa gac gat gtt cct gat 1508 

His Phe Asp Leu Ser Asn Pro Tyr Thr Gly Gin Asp Asp Val Pro Asp 
450 455 460 



tat agt get tat tgt caa aaa ata gat tat ctt aat cag aaa tat gga 
Tyr Ser Ala Tyr Cys Gin Lys lie Asp Tyr Leu Asn Gin Lys Tyr Gly 
465 470 475 480 



1556 



aat cga ttt aaa aaa gga att gaa ate ggt tat ttt aaa gat agg gaa 
Asn Arg Phe Lys Lys Gly lie Glu lie Gly Tyr Phe Lys Asp Arg Glu 
485 490 495 



1604 



tea gat att tta gat tat tta aaa aat aaa gaa ttt gat tta aaa eta 
Ser Asp lie Leu Asp Tyr Leu Lys Asn Lys Glu Phe Asp Leu Lys Leu 
500 505 " 510 



1652 



ttg tea ate cat cat aat ggt agg tat gat tat ctg caa gaa gaa get 
Leu Ser lie His His Asn Gly Arg Tyr Asp Tyr Leu Gin Glu Glu Ala 
515 520 525 



1700 



ctg aaa gta cca aca aag gga get ttt age aga tta ctt taatcgtatg 
Leu Lys Val Pro Thr Lys Gly Ala Phe Ser Arg Leu Leu 
530 535 540 



1749 



gaatttgeca 
aagttaaact 
ataaagatgt 
gggaatgaaa 
tactctatag 
caaggtctgc 
taaaaaagct 



taggccgtgt 
tagatgtaga 
tatctaaggg 
aactttatcg 
getctgaegg 
taaaggacta 
aggcaatatt 



ggaagegcac 
agatttaaaa 
gttagctttt 
etatgettta 
tcatattcct 
tcaaattgat 
gcttagcttt 



gttttagctc 
ccgtttgaaa 
gaactaaata 
gagatactca 
gaacattttt 
gaaaatcatt 
tttgtaatgc 



actttgatta 
cgcaattgaa 
ccaaatccct 
aacagcttgg 
gttatgaatt 
tgatatgagg 
tattgatagt 



tggttttcgt 
gegcatttte 
ttatctatat 
ttgtaaacaa 
tgatagactt 
aaatttttga 
tttagtgaaa 



1809 
1869 
1929 
1989 
2049 
2109 
2169 
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atttcaaaaa aataaagaaa tcatttactt gttgcaagcg cttgcgtaaa ttgttatgat 2229 

tttattggta acaattcatt aaaaaaggag aatgat atg aaa aga aaa gac tta 2283 

Met Lys Arg Lys Asp Leu 
545 

ttt ggt gat aaa caa act caa tac acg att aga aag tta agt gtt gga 2331 

Phe Gly Asp Lys Gin Thr Gin Tyr Thr lie Arg Lys Leu Ser Val Gly 
550 555 560 

gta get tea gtt aca aca ggg gta tgt att ttt ctt cat agt cca cag 2379 

Val Ala Ser Val Thr Thr Gly Val Cys He Phe Leu His Ser Pro Gin 
565 570 575 

gta ttt get gaa gaa gta agt gtt tct cct gca act aca gcg att gca 2427 

Val Phe Ala Glu Glu Val Ser Val Ser Pro Ala Thr Thr Ala He Ala 
580 585 590 595 

gag teg aat att aat cag gtt gac aac caa caa tct act aat tta aaa 2475 

Glu Ser Asn He Asn Gin Val Asp Asn Gin Gin Ser Thr Asn Leu Lys 
600 605 610 

gat gac ata aac tea aac tct gag acg gtt gtg aca ccc tea gat atg 2523 

Asp Asp He Asn Ser Asn Ser Glu Thr Val Val Thr Pro Ser Asp Met 

615 620 625 

ccg gat ace aag caa tta gta tea gat gaa act gac act caa aag gga 2571 

Pro Asp Thr Lys Gin Leu Val Ser Asp Glu Thr Asp Thr Gin Lys Gly 
630 635 640 

gtg aca gag ccg gat aag gcg aca age ctg ctt gaa gaa aat aaa ggt 2619 

Val Thr Glu Pro Asp Lys Ala Thr Ser Leu Leu Glu Glu Asn Lys Gly 
645 650 655 

cct gtt tea gat aaa aat acc tta gat tta aaa gta gca cca tct aca 2667 

Pro Val Ser Asp Lys Asn Thr Leu Asp Leu Lys Val Ala Pro Ser Thr 
660 665 670 675 

ttg caa aat act ccc gac aaa act tct caa get ata ggt get cca age 2715 

Leu Gin Asn Thr Pro Asp Lys Thr Ser Gin Ala He Gly Ala Pro Ser 
680 685 690 

cct acc ttg aaa gta get aat caa get cca egg att gaa aat ggt tac 2763 

Pro Thr Leu Lys Val Ala Asn Gin Ala Pro Arg He Glu Asn Gly Tyr 

695 700 705 

ttt agg eta cat ctt aaa gaa ttg cct caa ggt cat cct gta gaa age 2811 

Phe Arg Leu His Leu Lys Glu Leu Pro Gin Gly His Pro Val Glu Ser 
710 715 720 

act gga ctt tgg ata tgg gga gat gtt gat caa ccg tct agt aat tgg 2859 

Thr Gly Leu Trp He Trp Gly Asp Val Asp Gin Pro Ser Ser Asn Trp 
725 730 735 

cca aat ggt get ate cct atg act gat get aag aaa gat gat tac ggt 2907 

Pro Asn Gly Ala He Pro Met Thr Asp Ala Lys Lys Asp Asp Tyr Gly 
740 745 750 755 
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tat tat gtt 
Tyr Tyr Val 



ttt tta att 
Phe Leu lie 



att cca tta 
lie Pro Leu 
790 

tac ggt ata 
Tyr Gly lie 
805 

aac tat ttg 
Asn Tyr Leu 
820 

ttt aaa gat 
Phe Lys Asp 



ttt gtg aat 
Phe Val Asn 



act aac gcc 
Thr Asn Ala 
870 

gga gat gca 
Gly Asp Ala 
885 

get aac cat 
Ala Asn His 
900 

aat aat cct 
Asn Asn Pro 



att gat tta 
lie Asp Leu 



gat aaa act 
Asp Lys Thr 
950 

aat get ata 
Asn Ala lie 
965 



gat ttt aaa 
Asp Phe Lys 
760 

aat aac aaa 
Asn Asn Lys 
775 

tta cga cct 
Leu Arg Pro 



cat act tat 
His Thr Tyr 



agt tec tct 
Ser Ser Ser 
825 

gtt gca ace 
Val Ala Thr 
840 

caa gga eta 
Gin Gly Leu 
855 

aaa gag att 
Lys Glu lie 



gtg aaa gtt 
Val Lys Val 



aac caa att 
Asn Gin lie 
905 

tat tac att 
Tyr Tyr lie 
920 

aca agt att 
Thr Ser lie 
935 

gaa att tta 
Glu lie Leu 



caa att tct 
Gin lie Ser 



tta tct gaa 
Leu Ser Glu 



gca ggg aca 
Ala Gly Thr 
780 

gag atg aac 
Glu Met Asn 
795 

caa ccc etc 
Gin Pro Leu 
810 

agt aac tat 
Ser Asn Tyr 



ccy tea aca 
Xaa Ser Thr 



tat gga agg 
Tyr Gly Arg 
860 

ggt ttt eta 
Gly Phe Leu 
875 

caa ccc aac 
Gin Pro Asn 
890 

ttt gta aaa 
Phe Val Lys 



gat caa gtg 
Asp Gin Val 



caa gca agt 
Gin Ala Ser 
940 

aaa gaa ttg 
Lys Glu Leu 
955 

gat ate act 
Asp lie Thr 
970 



aaa caa cga 
Lys Gin Arg 
765 

aat tta age 
Asn Leu Ser 



caa gtt tgg 
Gin Val Trp 



aaa gaa ggg 
Lys Glu Gly 
815 

gac cac tta 
Asp His Leu 
830 

act tgg cca 
Thr Trp Pro 
845 

tat att gat 
Tyr lie Asp 



ate tta gat 
lie Leu Asp 



gac tat gtt 
Asp Tyr Val 
895 

gat aag gat 
Asp Lys Asp 
910 

cag eta aag 
Gin Leu Lys 
925 

ttt aca act 
Phe Thr Thr 



aaa gtg act 
Lys Val Thr 



etc gat act 
Leu Asp Thr 
975 



aaa caa ata 
Lys Gin lie 
770 

ggc gat cat 
Gly Asp His 
785 

att gat gaa 
lie Asp Glu 
800 

tat gtc cgt 
Tyr Val Arg 



tea gca tgg 
Ser Ala Trp 



gat ggt agt 
Asp Gly Ser 
850 

gta tea eta 
Val Ser Leu 
865 

gaa agt aag 
Glu Ser Lys 
880 

ttt aga gat 
Phe Arg Asp 



cca aag gtt 
Pro Lys Val 



gat gcc caa 
Asp Ala Gin 
930 

eta gat ggg 
Leu Asp Gly 
945 

gat aaa aat 
Asp Lys Asn 
960 

agt aaa tct 
Ser Lys Ser 



tct 2955 
Ser 



cat 3003 
His 



aag 3051 
Lys 



att 3099 
He 



etc 3147 

Leu 

835 

aat 3195 
Asn 



aaa 3243 
Lys 



aca 3291 
Thr 



tta 3339 
Leu 



tat 3387 

Tyr 

915 

caa 3435 
Gin 



gta 3483 
Val 



caa 3531 
Gin 



ctt 3579 
Leu 
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tta ata ate aaa ggc gac ttt aat cct aaa caa ggt cat ttc aac ata 3627 

Leu lie lie Lys Gly Asp Phe Asn Pro Lys Gin Gly His Phe Asn lie 
980 985 990 995 

tct tat aat ggt aac aat gtc atg aca agg caa tct tgg gaa ttt aaa 3675 

Ser Tyr Asn Gly Asn Asn Val Met Thr Arg Gin Ser Trp Glu Phe Lys 
1000 1005 1010 

gac caa ctt tat get tat agt gga aat tta ggt gca gtt etc aat caa 3723 

Asp Gin Leu Tyr Ala Tyr Ser Gly Asn Leu Gly Ala Val Leu Asn Gin 

1015 1020 1025 

gat ggt tea aaa gtt gaa gee age etc tgg tea ccg agt get gat agt 3771 

Asp Gly Ser Lys Val Glu Ala Ser Leu Trp Ser Pro Ser Ala Asp Ser 

1030 1035 1040 

gtc act atg att att tat gac aaa gat aac caa aac agg gtt gta gcg 3819 

Val Thr Met lie lie Tyr Asp Lys Asp Asn Gin Asn Arg Val Val Ala 
1045 1050 1055 

act ace ccc ctt gtg aaa aat aat aaa ggt gtt tgg cag acg ata ctt 3867 

Thr Thr Pro Leu Val Lys Asn Asn Lys Gly Val Trp Gin Thr lie Leu 
1060 1065 1070 1075 

gat act aaa tta ggt att aaa aac tat act ggt tac tat tat ctt tac 3915 

Asp Thr Lys Leu Gly lie Lys Asn Tyr Thr Gly Tyr Tyr Tyr Leu Tyr 
1080 1085 1090 

gaa ata aaa aga ggt aag gat aag gtt aag att tta gat cct tat gca 3963 

Glu lie Lys Arg Gly Lys Asp Lys Val Lys lie Leu Asp Pro Tyr Ala 

1095 1100 1105 

aag tea tta gca gag tgg gat agt aat act gtt aat gat gat att aaa 4011 

Lys Ser Leu Ala Glu Trp Asp Ser Asn Thr Val Asn Asp Asp lie Lys 

1110 1115 1120 

acg get aaa gca get ttt gta aat cca agt caa ctt gga cct caa aat 4059 

Thr Ala Lys Ala Ala Phe Val Asn Pro Ser Gin Leu Gly Pro Gin Asn 
1125 1130 1135 

tta agt ttt get aaa att get aat ttt aaa gga aga caa gat get gtt 4107 

Leu Ser Phe Ala Lys lie Ala Asn Phe Lys Gly Arg Gin Asp Ala Val 
1140 1145 1150 1155 

ata tac gaa gca cat gta aga gac ttc act tct gat cga tct ttg gat 4155 

lie Tyr Glu Ala His Val Arg Asp Phe Thr Ser Asp Arg Ser Leu Asp 
1160 1165 1170 

gga aaa tta aaa aat caa ttt ggt ace ttt gca gee ttt tea gag aaa 4203 

Gly Lys Leu Lys Asn Gin Phe Gly Thr Phe Ala Ala Phe Ser Glu Lys 

1175 1180 1185 

eta gat tat tta cag aaa tta gga gtt aca cac att cag ctt tta ccg 4251 

Leu Asp Tyr Leu Gin Lys Leu Gly Val Thr His lie Gin Leu Leu Pro 

1190 1195 1200 
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gta ttg agt tat ttt tat gtt aat gaa atg gat aag tea cgc tea aca 4299 

Val Leu Ser Tyr Phe Tyr Val Asn Glu Met Asp Lys Ser Arg Ser Thr 
1205 1210 1215 

get tac act tec tea gac aat aat tac aat tgg ggc tat gac cca cag 4347 

Ala Tyr Thr Ser Ser Asp Asn Asn Tyr Asn Trp Gly Tyr Asp Pro Gin 
1220 1225 1230 1235 

age tat ttt get ctt tct ggg atg tat tea gag aaa cca aaa gat cca 4395 

Ser Tyr Phe Ala Leu Ser Gly Met Tyr Ser Glu Lys Pro Lys Asp Pro 

1240 1245 1250 

tea gca cgt ate gec gaa tta aaa caa tta ata cat gat att cat aaa 4443 

Ser Ala Arg lie Ala Glu Leu Lys Gin Leu lie His Asp lie His Lys 

1255 1260 1265 

c 9t 99 c a ^g ggg gtt ata ctt gat gtc gtc tat aat cac act gca aaa 4491 

Arg Gly Met Gly Val lie Leu Asp Val Val Tyr Asn His Thr Ala Lys 
1270 1275 1280 

act tat etc ttt gag gat ata gaa cct aat tat tat cac ttt atg aat 4539 

Thr Tyr Leu Phe Glu Asp lie Glu Pro Asn Tyr Tyr His Phe Met Asn 
1285 1290 1295 

gaa gat ggt tea cca aga gaa agt ttt gga ggg gga cgt tta gga ace 4587 

Glu Asp Gly Ser Pro Arg Glu Ser Phe Gly Gly Gly Arg Leu Gly Thr 
1300 1305 1310 1315 

act cat gca atg agt cgt cgt gtt ttg gtt gat tec att aaa tat ctt 4635 

Thr His Ala Met Ser Arg Arg Val Leu Val Asp Ser lie Lys Tyr Leu 

1320 1325 1330 

aca agt gaa ttt aaa gtt gat ggt ttc cgt ttt gat atg atg gga gat 4683 

Thr Ser Glu Phe Lys Val Asp Gly Phe Arg Phe Asp Met Met Gly Asp 

1335 1340 1345 

cat gat gcg get gcg att gaa tta get tat aaa gaa get aaa get att 4731 

His Asp Ala Ala Ala lie Glu Leu Ala Tyr Lys Glu Ala Lys Ala lie 
1350 1355 1360 

aat cct aat atg att atg att ggt gag ggc tgg aga aca ttc caa ggc 4779 

Asn Pro Asn Met lie Met lie Gly Glu Gly Trp Arg Thr Phe Gin Gly 
1365 1370 1375 

gat caa ggt cag ccg gtt aaa cca get gac caa gat tgg atg aag tea 4827 

Asp Gin Gly Gin Pro Val Lys Pro Ala Asp Gin Asp Trp Met Lys Ser 
1380 1385 1390 1395 

acc gat aca gtt ggc gtc ttt tea gat gat att cgt aat age ttg aaa 4875 

Thr Asp Thr Val Gly Val Phe Ser Asp Asp He Arg Asn Ser Leu Lys 

1400 1405 1410 

tct ggt ttt cca aat gaa ggt act cca get ttc ate aca ggt ggc cca 4923 

Ser Gly Phe Pro Asn Glu Gly Thr Pro Ala Phe He Thr Gly Gly Pro 

1415 1420 1425 
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caa tct tta caa ggt att ttt aaa aat ate aaa gca caa cct ggg aat 4971 

Gin Ser Leu Gin Gly lie Phe Lys Asn lie Lys Ala Gin Pro Gly Asn 

1430 1435 1440 

ttt gaa gca gat teg cca gga gat gtg gtg cag tat att get gca cat 5019 

Phe Glu Ala Asp Ser Pro Gly Asp Val Val Gin Tyr lie Ala Ala His 

1445 1450 1455 

gat aac ctt ace ttg cat gat gtg att gca aaa tea att 5058 

Asp Asn Leu Thr Leu His Asp Val lie Ala Lys Ser lie 

1460 1465 1470 



<210> 23 

<211> 221 

<212> PRT 

<213> streptococcus 



<400> 23 



Asn 


Leu 


Lys 


Ala 


Glu 


Leu 


Ser 


Val 


1 








5 








Val 


Tyr 


Gly 


Lys 


Ser 


Ala 


His 


Gly 








20 










Gly Ala 


Thr 


Tyr 


Leu 


Ala 


Leu 


Tyr 






35 










40 


Pro 


Ala 


Arg 


Ala 


Phe 


Leu 


Asp 


Val 




50 










55 




Phe 


Ser 


Gly 


Glu 


Lys 


Leu 


Gly 


Val 


65 










70 






Pro 


Leu 


Ser 


Met 


Asn 


Ala 


Gly Val 










85 








Asp 


Asn 


Thr 


He 


Ala 


Leu 


Asn 


Phe 








100 










Lys 


Thr 


He 


Gin 


Thr 


Lys 


Leu 


Glu 






115 










120 


Thr 


Leu 


Ser 


Asp 


His 


Glu 


His 


Thr 




13 0 










135 




Glu 


Leu 


Val 


Ser 


Thr 


Leu 


Leu 


Ala 


145 










150 






Lys 


Gly 


His 


Glu 


Gin 


Val 


He 


Gly 










165 








Glu 


Arg 


Gly 


Val 


Ala 


Tyr 


Gly Ala 








180 










Met 


His 


Gin 


Ala 


Asn 


Glu 


Tyr 


Met 






195 










200 


Ala 


Ala 


He 


Tyr 


Ala 


Glu 


Ala 


He 




210 










215 





<210> 24 

<211> 194 

<212> PRT 

<213> streptococcus 



Glu 


Asp 


Glu 


Gin 


Tyr 


Thr 


Ala 


Thr 




10 










15 




Ser 


Thr 


Pro 


Gin 


Glu 


Gly 


Val 


Asn 


25 










30 






Leu 


Ser 


Gin 


Phe 


Asp 


Phe 


Glu 


Gly 










45 








Thr 


Ala 


Asn 


He 


He 


His 


Glu 


Asp 








60 










Ala 


Tyr 


Glu 


Asp 


Asp 


Cys 


Met 


Gly 






75 










80 


Phe 


Gin 


Phe 


Asp 


Glu 


Thr 


Asn 


Asp 




90 










95 




Arg 


Tyr 


Pro 


Gin 


Gly 


Thr 


Asp 


Ala 


105 










110 






Lys 


Leu 


Asn 


Gly 


Val 


Glu 


Lys 


Val 










125 








Pro 


His 


Tyr 


Val 


Pro 


Met 


Asp 


Asp 








140 










Val 


Tyr 


Glu 


Lys 


Gin 


Thr 


Gly 


Leu 






155 










160 


Gly 


Gly 


Thr 


Phe 


Gly 


Arg 


Leu 


Leu 




170 










175 




Met 


Phe 


Pro 


Gly 


Asp 


Glu 


Asn 


Thr 


185 










190 






Pro 


Leu 


Glu 


Asn 


He 


Phe 


Arg 


Ser 










205 








Tyr 


Glu 


Leu 


He 


Lys 









220 



<400> 24 
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Met 


Thr 


Asp 


Leu 


Glu 


Lys 


He 


He 


Lys 


Ala 


He 


Lys 


Ser 


Asp 


Ser 


Gin 


1 








5 










10 










15 




Asn 


Gin 


Asn 


Tyr 


Thr 


Glu 


Asn 


Gly 


He 


Asp 


Pro 


Leu 


Phe 


Ala 


Ala 


Pro 








20 










25 










30 






Lys 


Thr 


Ala 


Arg 


He 


Asn 


He 


Val 


Gly 


Gin 


Ala 


Pro 


Gly 


Leu 


Lys 


Thr 






35 










40 










45 








Gin 


Glu 


Ala 


Arg 


Leu 


Tyr 


Trp 


Lys 


Asp 


Lys 


Ser 


Gly Asp 


Arg 


Leu 


Arg 




50 










55 










60 










Gin 


Trp 


Leu 


Gly 


Val 


Asp 


Glu 


Glu 


Thr 


Phe 


Tyr 


His 


Ser 


Gly 


Lys 


Phe 


65 










70 










75 










80 


Ala 


Val 


Leu 


Pro 


Leu 


Asp 


Phe 


Tyr 


Tyr 


Pro 


Gly 


Lys 


Gly 


Lys 


Ser 


Gly 










85 










90 










95 




Asp 


Leu 


Pro 


Pro 


Arg 


Lys 


Gly 


Phe 


Ala 


Glu 


Lys 


Trp 


His 


Pro 


Leu 


He 








100 










105 










110 






Leu 


Lys 


Glu 


Met 


Pro 


Asn 


Val 


Gin 


Leu 


Thr 


Leu 


Leu 


Val 


Gly 


Gin 


Tyr 






115 










120 










125 








Ala 


Gin 


Lys 


Tyr 


Tyr 


Leu 


Gly 


Ser 


Ser 


Ala 


His 


Lys 


Asn 


Leu 


Thr 


Glu 




130 










135 










140 










Thr 


Val 


Lys 


Ala 


Tyr 


Lys 


Asp 


Tyr 


Leu 


Pro 


Asp 


Tyr 


Leu 


Pro 


Leu 


Val 


145 










150 










155 










160 


His 


Pro 


Ser 


Pro 


Arg 


Asn 


Gin 


He 


Trp 


Leu 


Lys 


Lys 


Asn 


Pro 


Trp 


Phe 










165 










170 










175 




Glu 


Lys 


Asp 


Leu 


He 


Val 


Asp 


Leu 


Gin 


Lys 


He 


Val 


Ala 


Asp 


He 


Leu 








180 










185 










190 






Lys 


Asp 
































<210> 


25 




























<211> 


126 




























<212> 


PRT 




























<213> 


streptococcus 






















<400> 


25 


























Met 


Arg 


Asp 


Asn 


His 


Leu 


His 


Thr 


Tyr 


Phe 


Ser 


Tyr 


Asp 


Cys 


Gin 


Thr 


1 








5 










10 










15 




Ala 


Phe 


Glu 


Asp 


Tyr 


He 


Asn 


Gly 


Phe 


Thr 


Gly 


Glu 


Phe 


He 


Thr 


Thr 








20 










25 










30 






Glu 


His 


Phe 


Asp 


Leu 


Ser 


Asn 


Pro 


Tyr 


Thr 


Gly 


Gin 


Asp 


Asp 


Val 


Pro 






35 










40 










45 








Asp 


Tyr 


Ser 


Ala 


Tyr 


Cys 


Gin 


Lys 


He 


Asp 


Tyr 


Leu 


Asn 


Gin 


Lys 


Tyr 




50 










55 










60 










Gly 


Asn 


Arg 


Phe 


Lys 


Lys 


Gly 


He 


Glu 


He 


Gly 


Tyr 


Phe 


Lys 


Asp 


Arg 


65 










70 










75 










80 


Glu 


Ser 


Asp 


He 


Leu 


Asp 


Tyr 


Leu 


Lys 


Asn 


Lys 


Glu 


Phe 


Asp 


Leu 


Lys 










85 










90 










95 




Leu 


Leu 


Ser 


He 


His 


His 


Asn 


Gly 


Arg 


Tyr 


Asp 


Tyr 


Leu 


Gin 


Glu 


Glu 








100 










105 










110 






Ala 


Leu 


Lys 


Val 


Pro 


Thr 


Lys 


Gly 


Ala 


Phe 


Ser 


Arg 


Leu 


Leu 










115 










120 










125 









<210> 26 
<211> 931 
<212> PRT 

<213> streptococcus 
<400> 26 
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Met Lys Arg Lys 
1 

Arg Lys Leu Ser 
20 

Phe Leu His Ser 
35 

Ala Thr Thr Ala 
50 

Gin Ser Thr Asn 
65 

Val Thr Pro Ser 

Thr Asp Thr Gin 
100 

Leu Glu Glu Asn 
115 

Lys Val Ala Pro 
130 

Ala lie Gly Ala 
145 

Arg lie Glu Asn 

Gly His Pro Val 
180 

Gin Pro Ser Ser 
195 

Lys Lys Asp Asp 
210 

Gin Arg Lys Gin 
225 

Leu Ser Gly Asp 

Val Trp lie Asp 
260 

Glu Gly Tyr Val 
275 

His Leu Ser Ala 
290 

Trp Pro Asp Gly 
305 

lie Asp Val Ser 

Leu Asp Glu Ser 
340 

Tyr Val Phe Arg 
355 

Lys Asp Pro Lys 
370 

Leu Lys Asp Ala 
385 

Thr Thr Leu Asp 

Val Thr Asp Lys 
420 

Asp Thr Ser Lys 
435 



Asp Leu Phe Gly 
5 

Val Gly Val Ala 

Pro Gin Val Phe 
40 

lie Ala Glu Ser 
55 

Leu Lys Asp Asp 
70 

Asp Met Pro Asp 
85 

Lys Gly Val Thr 

Lys Gly Pro Val 
120 

Ser Thr Leu Gin 
135 

Pro Ser Pro Thr 
150 

Gly Tyr Phe Arg 
165 

Glu Ser Thr Gly 

Asn Trp Pro Asn 
200 

Tyr Gly Tyr Tyr 
215 

lie Ser Phe Leu 
230 

His His lie Pro 
245 

Glu Lys Tyr Gly 

Arg He Asn Tyr 
280 

Trp Leu Phe Lys 
295 

Ser Asn Phe Val 
310 

Leu Lys Thr Asn 
325 

Lys Thr Gly Asp 

Asp Leu Ala Asn 
360 

Val Tyr Asn Asn 
375 

Gin Gin He Asp 
390 

Gly Val Asp Lys 
405 

Asn Gin Asn Ala 

Ser Leu Leu He 
440 



Asp Lys Gin Thr 
10 

Ser Val Thr Thr 
25 

Ala Glu Glu Val 

Asn He Asn Gin 
60 

He Asn Ser Asn 
75 

Thr Lys Gin Leu 
90 

Glu Pro Asp Lys 
105 

Ser Asp Lys Asn 

Asn Thr Pro Asp 
140 

Leu Lys Val Ala 
155 

Leu His Leu Lys 
170 

Leu Trp He Trp 
185 

Gly Ala He Pro 

Val Asp Phe Lys 
220 

He Asn Asn Lys 
235 

Leu Leu Arg Pro 
250 

He His Thr Tyr 
2 65 

Leu Ser Ser Ser 

Asp Val Ala Thr 
300 

Asn Gin Gly Leu 
315 

Ala Lys Glu He 
330 

Ala Val Lys Val 
345 

His Asn Gin He 

Pro Tyr Tyr He 
380 

Leu Thr Ser He 
395 

Thr Glu He Leu 
410 

He Gin He Ser 
425 

He Lys Gly Asp 



Gin Tyr Thr He 
15 

Gly Val Cys He 
30 

Ser Val Ser Pro 
45 

Val Asp Asn Gin 

Ser Glu Thr Val 
80 

Val Ser Asp Glu 
95 

Ala Thr Ser Leu 
110 

Thr Leu Asp Leu 
125 

Lys Thr Ser Gin 

Asn Gin Ala Pro 
160 

Glu Leu Pro Gin 
175 

Gly Asp Val Asp 
190 

Met Thr Asp Ala 
205 

Leu Ser Glu Lys 

Ala Gly Thr Asn 
240 

Glu Met Asn Gin 
255 

Gin Pro Leu Lys 
270 

Ser Asn Tyr Asp 
285 

Xaa Ser Thr Thr 

Tyr Gly Arg Tyr 
320 

Gly Phe Leu He 
335 

Gin Pro Asn Asp 
350 

Phe Val Lys Asp 
365 

Asp Gin Val Gin 

Gin Ala Ser Phe 
400 

Lys Glu Leu Lys 
415 

Asp He Thr Leu 
430 

Phe Asn Pro Lys 
445 
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Gin 


Gly 


His 


Phe 


Asn 


He 


Ser 


Tyr 


Asn 


Gly 


Asn 


Asn 


Val 


Met 


Thr 


Arg 




450 










455 










460 










Gin 


Ser 


Trp 


Glu 


Phe 


Lys 


Asp 


Gin 


Leu 


Tyr 


Ala 


Tyr 


Ser 


Gly 


Asn 


Leu 


465 










470 










475 










480 


Gly 


Ala 


Val 


Leu 


Asn 


Gin 


Asp 


Gly 


Ser 


Lys 


Val 


Glu 


Ala 


Ser 


Leu 


Trp 










485 










490 










495 




Ser 


Pro 


Ser 


Ala 


Asp 


Ser 


Val 


Thr 


Met 


He 


He 


Tyr 


Asp 


Lys 


Asp 


Asn 








500 










505 










510 






Gin 


Asn 


Arg 


Val 


Val 


Ala 


Thr 


Thr 


Pro 


Leu 


Val 


Lys 


Asn 


Asn 


Lys 


Gly 






515 










520 










525 








Val 


Trp 


Gin 


Thr 


He 


Leu 


Asp 


Thr 


Lys 


Leu 


Gly 


He 


Lys 


Asn 


Tyr 


Thr 




530 










535 










540 










Gly 


Tyr 


Tyr 


Tyr 


Leu 


Tyr 


Glu 


He 


Lys 


Arg 


Gly 


Lys 


Asp 


Lys 


Val 


Lys 


545 










550 










555 










560 


He 


Leu 


Asp 


Pro 


Tyr 


Ala 


Lys 


Ser 


Leu 


Ala 


Glu 


Trp 


Asp 


Ser 


Asn 


Thr 










565 










570 










575 




Val 


Asn 


Asp 


Asp 


He 


Lys 


Thr 


Ala 


Lys 


Ala 


Ala 


Phe 


Val 


Asn 


Pro 


Ser 








580 










585 










590 






Gin 


Leu 


Gly 


Pro 


Gin 


Asn 


Leu 


Ser 


Phe 


Ala 


Lys 


He 


Ala 


Asn 


Phe 


Lys 






595 










600 










605 








Gly 


Arg 


Gin 


Asp 


Ala 


Val 


He 


Tyr 


Glu 


Ala 


His 


Val 


Arg 


Asp 


Phe 


Thr 




610 










615 










620 










Ser 


Asp 


Arg 


Ser 


Leu 


Asp 


Gly 


Lys 


Leu 


Lys 


Asn 


Gin 


Phe 


Gly 


Thr 


Phe 


625 










630 










635 










640 


Ala 


Ala 


Phe 


Ser 


Glu 


Lys 


Leu 


Asp 


Tyr 


Leu 


Gin 


Lys 


Leu 


Gly 


Val 


Thr 










645 










650 










655 




His 


He 


Gin 


Leu 


Leu 


Pro 


Val 


Leu 


Ser 


Tyr 


Phe 


Tyr 


Val 


Asn 


Glu 


Met 








660 










665 










670 






Asp 


Lys 


Ser 


Arg 


Ser 


Thr 


Ala 


Tyr 


Thr 


Ser 


Ser 


Asp 


Asn 


Asn 


Tyr 


Asn 






675 










680 










685 








Trp 


Gly 


Tyr 


Asp 


Pro 


Gin 


Ser 


Tyr 


Phe 


Ala 


Leu 


Ser 


Gly 


Met 


Tyr 


Ser 




690 










695 










700 










Glu 


Lys 


Pro 


Lys 


Asp 


Pro 


Ser 


Ala 


Arg 


He 


Ala 


Glu 


Leu 


Lys 


Gin 


Leu 


705 










710 










715 










720 


He 


His 


Asp 


He 


His 


Lys 


Arg 


Gly 


Met 


Gly 


Val 


He 


Leu 


Asp 


Val 


Val 










725 










730 










735 




Tyr 


Asn 


His 


Thr 


Ala 


Lys 


Thr 


Tyr 


Leu 


Phe 


Glu 


Asp 


He 


Glu 


Pro 


Asn 








740 










745 










750 






Tyr 


Tyr 


His 


Phe 


Met 


Asn 


Glu 


Asp 


Gly 


Ser 


Pro 


Arg 


Glu 


Ser 


Phe 


Gly 






755 










760 










765 








Gly 


Gly 


Arg 


Leu 


Gly 


Thr 


Thr 


His 


Ala 


Met 


Ser 


Arg 


Arg 


Val 


Leu 


Val 




770 










775 










780 










Asp 


Ser 


He 


Lys 


Tyr 


Leu 


Thr 


Ser 


Glu 


Phe 


Lys 


Val 


Asp 


Gly 


Phe 


Arg 


785 










790 










795 










800 


Phe 


Asp 


Met 


Met 


Gly 


Asp 


His 


Asp 


Ala 


Ala 


Ala 


He 


Glu 


Leu 


Ala 


Tyr 










805 










810 










815 




Lys 


Glu 


Ala 


Lys 


Ala 


He 


Asn 


Pro 


Asn 


Met 


He 


Met 


He 


Gly 


Glu 


Gly 








820 










825 










830 






Trp 


Arg 


Thr 


Phe 


Gin 


Gly 


Asp 


Gin 


Gly 


Gin 


Pro 


Val 


Lys 


Pro 


Ala 


Asp 






835 










840 










845 








Gin 


Asp 


Trp 


Met 


Lys 


Ser 


Thr 


Asp 


Thr 


Val 


Gly 


Val 


Phe 


Ser 


Asp 


Asp 




850 










855 










860 










He 


Arg 


Asn 


Ser 


Leu 


Lys 


Ser 


Gly 


Phe 


Pro 


Asn 


Glu 


Gly 


Thr 


Pro 


Ala 


865 










870 










875 










880 


Phe 


He 


Thr 


Gly 


Gly 


Pro 


Gin 


Ser 


Leu 


Gin 


Gly 


He 


Phe 


Lys 


Asn 


He 










885 










890 










895 
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Lys Ala Gin Pro Gly Asn Phe Glu Ala Asp Ser Pro Gly Asp Val Val 

900 905 910 

Gin Tyr lie Ala Ala His Asp Asn Leu Thr Leu His Asp Val lie Ala 

915 920 925 

Lys Ser lie 
930 

<210> 27 

<211> 5607 

<212> DNA 

<213> streptococcus 

<220> 

<221> CDS 

<222> (2) . . . (301) 

<400> 27 

a att caa agt ttg aca gaa ggt caa ctt cgt tct gat ate cct gag ttc 49 

lie Gin Ser Leu Thr Glu Gly Gin Leu Arg Ser Asp lie Pro Glu Phe 
15 10 15 

cgt get ggt gat act gta cgt gtt cac get aaa gtt gtt gaa ggt act 97 
Arg Ala Gly Asp Thr Val Arg Val His Ala Lys Val Val Glu Gly Thr 
20 25 30 

cgc gaa cgt att cag ate ttt gaa ggt gtt gtt ate tea cgt aaa ggt 145 
Arg Glu Arg lie Gin lie Phe Glu Gly Val Val lie Ser Arg Lys Gly 
35 40 45 

caa gga ate tea gaa atg tac aca gta cgt aaa att tct ggt ggt ate 193 
Gin Gly lie Ser Glu Met Tyr Thr Val Arg Lys lie Ser Gly Gly lie 
50 55 60 

ggt gta gag cgt aca ttc cca att cac act cct cgt gtt gat aaa ate 241 
Gly Val Glu Arg Thr Phe Pro He His Thr Pro Arg Val Asp Lys He 
65 70 75 80 

gaa gtt gtt cgt tat ggt aaa gta cgt cgt get aaa ctt tac tac tta 2 89 

Glu Val Val Arg Tyr Gly Lys Val Arg Arg Ala Lys Leu Tyr Tyr Leu 
85 90 95 

cgc gca ttg caa ggtaaagctg caegtattaa agaaatccgt cgttaatttt 341 
Arg Ala Leu Gin 
100 

gatgatcaga ttttaaaaat gcttggttgt ttgaggatag taactatgtt ttaaaactgg 401 

acaaccaaga cgtaaaaaat ctgcctgtgg gcagtttttt tactaggtcc ccttagttca 461 

atggatataa caactccctc ctaaggagta attgctggtt cgattccggc aggggacata 521 

ttcattgeat gtaaatagcg gtttagagct attttgcccc aaatttctct gattaagttt 581 

ategttccta tctttttgtt cttgtaattg atgtgcgtaa acttctaaag tgatatttaa 641 

attctcgtga tctaaaactt gagagatgga aattagatag ettgeaaatg tatgectgag 701 

agagtgeact cgtacctcgc gaccagttat tttteggata gttttattga ctgeattatt 761 

tgaaagtttg tcgaataatc tgtcgttttt attttttgta aattcatgea aaaaaaataa 821 

tgtatcattg tcaattggta tatttctgat actacttttg ttttttgttg gcaggtatct 881 

ttggttgaaa tgataatccc aagttttatt aattgataaa tatttgttag tgtaatcaat 941 

atcattaact gttaaaccta aacattcagc gaagcgcatg ccagttttag cgatgaggta 1001 
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taacgctgca 
ttcattggtt 
gtgaataaac 
tttacatgta 
atgacttgtt 
ttatacgata 
ctccacatta 
agatatttgt 
gtatactagt 
aaggcaagcc 
agactatcaa 
gcgtaagctt 
gagtaggcct 
acgtaatcgt 
aggaaacaga 
tacgatgagg 
tcaggctatt 
ctaatggcta 
atgacaacat 
ttgatgaaca 
gctggtgaca 
aaccagaaat 
ctcagtgcga 
aatgaaatcc 
cacctatcat 
tacaacaagc 
caactgaaag 
cttatgatga 
atcaatcaaa 
tcttttatta 
atttgctaaa 
cctcacgctc 
atttccatat 
tccaggaata 
gtaaaattta 
ttagcataag 
aaagaaatta 
atttcataaa 
ctggtgtcta 
gagttaaatg 
ggtactgtca 
gcttaataac 
ctagtttttt 
ttagatttcg 
atacgaaatt 
aagaaatttt 
atccagtgga 
aaatgctata 
aacaattgtt 
agaaaaggct 
atattttaat 
atttcaatat 
tataaataga 
taatactaaa 
tatagaagaa 
actaaaattg 



tacgattgat 
tcaagaaatt 
aaaaattttt 
tgttaaaacg 
gatatttagt 
ttcgatattc 
gctaaagaac 
aaatggtatg 
ggtgaagact 
acatttgaag 
gcagctattg 
gagaagaatg 
gatttaaccg 
ctcttagaaa 
cggtaagact 
aatctaatgt 
acaagtggct 
aaatcaagga 
ttattaat eg 
ttctggggat 
gattttacga 
ggtgcacaga 
ttaaagacct 
acttgttatg 
ecatagegat 
tggtctgacc 
tttctttggg 
gttggtcaat 
attaaacaac 
tttgactgtc 
atagctacaa 
tcctcgacca 
tcatcagttt 
aaggtaatac 
tgcgctccaa 
actttatttt 
gcatagctct 
cgtttcataa 
gtaacttata 
tctatttatt 
agtaegggat 
tatttcctta 
caatagagca 
tattgeatte 
cagtaacttc 
taatggaaat 
tgcaatagtt 
gcatatatta 
etttcttget 
ttatatgaaa 
attaatcaat 
gatggattag 
gaagatttat 
agaaatcttt 
aatgataaga 
ggtgatgaaa 



gttgtgattt 
ttatctctat 
ttctatatat 
ctttactgta 
ggaagtaata 
ccacccgttg 
agggtgttca 
atatctcaag 
tgaaagecac 
aacgtgtaga 
agaagtttgg 
gctcacaagg 
agattgagca 
tcgaggttag 
aggtaagcat 
gectattcag 
caategtcaa 
acttegtaga 
tcaacttggg 
tagttcagtc 
agaaaatatt 
tgtcacctat 
gtataacggt 
aagaccatta 

tgaggtagtc 

ttatccatgt 
tttttcaaga 
gatgtggcac 
ctgactcctc 
tacttgacag 
gaaaacgagc 
aaaattgagc 
gttttccgag 
taaaaataat 
tgccgcccaa 
tacaactgaa 
tttgaaaata 
aaggtatgta 
gattgataat 
gtaagctaaa 
gattattgaa 
accagacatc 
ttaggtagtt 
ttcataatga 
atatataaac 
agttttaaaa 
gcggagtaaa 
ccagaaaaaa 
tgtattcatg 
gttctaattc 
ttaagaatat 
caaaattgaa 
ctgacgaatt 
tatttagtag 
attcaattct 
ttgatcttga 



ttctttacaa 
ttacgcccct 
ccctcgtgaa 
tettgeacat 
ttgcaaagta 
tegegtttae 
aggctacctt 
tcgctctgtt 
tagtagagga 
gattgttaac 
tgtttcctac 
tttggttgat 
actttaactc 
tttactaaaa 
ttagcggagt 
gccttatgcc 
aaaacagatt 
ctctacaatg 
acaacttaaa 
attegtegtg 
ettaategtg 
cttcaatacg 
tctattatcg 
aaaaggggct 
aatatacttc 
cccggattgg 
ctgagtctta 
gttatatcga 
tagaattcag 
ggagccgttc 
catttaatgc 
gtgaggcttt 
agecatcaaa 
atataaaaaa 
aagaacgtta 
tttcatataa 
aaaaaattaa 
atctagtatt 
tttactagta 
ttgggaggtg 
acagecagta 
agttcattgt 
tttgataata 
gttatttgag 
ggctctatca 
acattagttg 
aatagagagg 
aaatagaaca 
tttaacaata 
ttcaatatca 
tgaaaaaata 
agatcttaaa 
taaaaatgtt 
tggagtattt 
tgttcatgaa 
attactagat 



atttttatca 
tattttttgc 
cagccatgga 
gcgtttgact 
atatatttcc 
ggaaataege 
gatggaaaag 
ctccaaaagt 
tatagcegta 
tacaccattg 
caacaaattt 
agacgtgtga 
aagattaaac 
aagttagaag 
tccaagtaat 
aactcttgaa 
ttgagacaaa 
gtatcttagg 
acaagaaacg 
ttagccatgc 
aatttacagc 
gtctgggagc 
cttatgagat 
agagctcaat 
caaagaatac 
caaatgtatt 
ccaccttaag 
attctacaac 
gaatcaggtt 
agattgetta 
ttatttctta 
ttgtttcatt 
gcttcgataa 
acctggccta 
ataaaacata 
atggattaga 
tataatatgg 
taggcaacac 
aaegtaatte 
aacttatgta 
tgcatcataa 
ttatcatege 
aaactatata 
attttccttt 
ttgagatagt 
taggegatgt 
agtaattagg 
cttattatat 
atgaaatcaa 
attacaaaaa 
aaagaggttg 
gtagttagtg 
gtttcactag 
agttttaaag 
gaatttgeta 
aeggaaaaaa 



agegtaagta 
tttaacctta 
tacgeagget 
ataatgattt 
tattatatgt 
cattgatata 
gctctcttag 
ggataaaacg 
tgaaacaagg 
cccatgggaa 
attcttgggt 
aagggttgga 
aattggagga 
acatcaaacg 
caagaattat 
ggggtctcgt 
aaatacaaag 
ttatcgccgt 
gattcgttga 
ttgtacaaaa 
cacagctcat 
taaagcttat 
tagtcacaac 
ccaggagcca 
cgttatatca 
gataatgeae 
aaatacaact 
acacaaegtt 
gcataactta 
acctttctaa 
tactgtcttg 
aaacgatgat 
ggtcgataat 
tttttcctgc 
aactactatg 
gtaagggata 
aaaaaatttt 
tattttgtca 
ttcgctttaa 
aaattagata 
aatctgtatt 
taccctaagt 
aacatgagaa 
gaataaatag 
ttgtcaaatg 
aaaaatatta 
aagtgataaa 
ttgctatttt 
gtaatgaaat 
aagatggtaa 
aagaaaaaat 
gtgagcaaag 
aagctacaag 
aaggaaaaaa 
aacaaaacaa 
gtggaaaaat 



1061 
1121 
1181 
1241 
1301 
1361 
1421 
1481 
1541 
1601 
1661 
1721 
1781 
1841 
1901 
1961 
2021 
2081 
2141 
2201 
2261 
2321 
2381 
2441 
2501 
2561 
2621 
2681 
2741 
2801 
2861 
2921 
2981 
3041 
3101 
3161 
3221 
3281 
3341 
3401 
3461 
3521 
3581 
3641 
3701 
3761 
3821 
3881 
3941 
4001 
4061 
4121 
4181 
4241 
4301 
4361 
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aaaaagtcat 
aggattatca 
aatattaaat 
tttagaatct 
gtattctatt 
aaaacatata 
attaatcttg 
tggaacaact 
accaagtata 
atttattaac 
tatgttaaac 
atcagttgta 
aataagttag 
aaattctaaa 
tgcgatagta 
tgataaagtt 
tagtaatcac 
tttatcgccg 
cgaattaggt 
gcaacaaagg 
tgagcctacc 



aaatttaaaa 
tctgatttta 
aaatcagaga 
acagagcttg 
aagaaagata 
attaaaataa 
attctatggt 
aagatacaaa 
atatcctcct 
tcagagaact 
ataacaacac 
atggcctctt 
gagcaaataa 
gaaaaagttt 
gggaagtcag 
caaacaggaa 
agaaaaaata 
attgaaaata 
ttagataaaa 
gtagctattg 
ggtaacctag 



ttataggaat 
gcgaaaatat 
ataatagaat 
ccttaaacaa 
ataaagcatt 
tgacttattc 
taagagaaag 
ttataaggca 
tatttttagg 
caatgatttt 
ttgcagaaag 
cattaatatt 
tggatatatt 
tgtcaggagt 
gaacaggaaa 
aaatcttgtt 
atatatcttt 
ttagactagt 
aacaaataaa 
ctagggcact 
acagtgttac 



cttttctggt 
ggtttttgta 
tgcaaataaa 
attgaaagac 
cgaagagtct 
gattatgtta 
aatttatga'a 
atttatattt 
gaatctacta 
cggtggaagt 
ttatttaata 
atttaagaaa 
agaaataaag 
aaatcaaaaa 
atccacactt 
taagaatgaa 
ggtatttcaa 
aaataaatca 
aagaaatgtt 
ggtatcagat 
tgctggagaa 



aaaaaacagg 
gattattcaa 
attttaatgt 
tttaaaattg 
ttagagtcag 
ggtggaatag 
ataggtatat 
gagttaatat 
ttaaaagtaa 
ttaataaata 
ttaataagta 
ccacaagaaa 
aatgtaaatt 
tttgaacttg 
ctttccttac 
gatatagaaa 
aattataatt 
gtagatgaga 
atgaaattat 
gccccaataa 
ataatt 



aaacatatac 
ctagccaaga 
attctggtag 
ataagtcaaa 
tgagtggaat 
ttgttctttc 
ttttatctat 
tcatatcaat 
ttgtagaagg 
aaagcagttt 
ttattgtttt 
tattatcaaa 
acagttacgc 
gaaagtttta 
ttgcaggact 
agaaaggata 
taatagatta 
gtatcttgtt 
ctggtggtca 
tactagctga 



4421 
4481 
4541 
4601 
4661 
4721 
4781 
4841 
4901 
4961 
5021 
5081 
5141 
5201 
5261 
5321 
5381 
5441 
5501 
5561 
5607 



<210> 28 

<211> 111 
<212> PRT 

<213> streptococcus 



<400> 28 



He 


Gin 


Ser 


Leu 


Thr 


Glu 


Gly 


Gin 


Leu 


Arg 


Ser 


Asp 


He 


Pro 


Glu 


Phe 


1 








5 










10 










15 




Arg 


Ala 


Gly 


Asp 


Thr 


Val 


Arg 


Val 


His 


Ala 


Lys 


Val 


Val 


Glu 


Gly 


Thr 








20 










25 










30 






Arg 


Glu 


Arg 


He 


Gin 


He 


Phe 


Glu 


Gly 


Val 


Val 


He 


Ser 


Arg 


Lys 


Gly 






35 










40 










45 








Gin 


Gly 


He 


Ser 


Glu 


Met 


Tyr 


Thr 


Val 


Arg 


Lys 


He 


Ser 


Gly 


Gly 


He 




50 










55 










60 










Gly 


Val 


Glu 


Arg 


Thr 


Phe 


Pro 


He 


His 


Thr 


Pro 


Arg 


Val 


Asp 


Lys 


He 


65 










70 










75 










80 


Glu 


Val 


Val 


Arg 


Tyr 


Gly 


Lys 


Val 


Arg 


Arg 


Ala 


Lys 


Leu 


Tyr 


Tyr 


Leu 










85 










90 










95 




Arg 


Ala 


Leu 


Gin 


Gly 


Lys 


Ala 


Ala 


Arg 


He 


Lys 


Glu 


He 


Arg 


Arg 





100 



105 



110 



<210> 29 
<211> 173 
<212> PRT 

<213> streptococcus 



<400> 29 

Met Arg Phe Ala Glu Cys Leu Gly Leu Thr Val Asn Asp He Asp Tyr 

15 10 15 

Thr Asn Lys Tyr Leu Ser He Asn Lys Thr Trp Asp Tyr His Phe Asn 

2 0 2 5 3 0 

Gin Arg Tyr Leu Pro Thr Lys Asn Lys Ser Ser He Arg Asn He Pro 

35 40 45 

He Asp Asn Asp Thr Leu Phe Phe Leu His Glu Phe Thr Lys Asn Lys 
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50 










55 










60 










Asn 


Asp 


Arg 


Leu 


Phe 


Asp 


Lys 


Leu 


Ser 


Asn 


Asn 


Ala 


Val 


Asn 


Lys 


Thr 


65 










70 










75 










80 


lie 


Arg 


Lys 


He 


Thr 


Gly 


Arg 


Glu 


Val 


Arg 


Val 


His 


Ser 


Leu 


Arg 


His 










85 










90 










95 




Thr 


Phe 


Ala 


Ser 


Tvr 
- 1 - j 


Leu 


He 


Ser 


He 


Ser 


Gin 


Val 


Leu 


Asp 


His 


Glu 








100 










105 










110 






Asn 


Leu 


Asn 


He 


Thr 


Leu 


Glu 


Val 


Tyr 


Ala 


His 


Gin 


Leu 


Gin 


Glu 


Gin 






115 










120 










125 








Lys 


Asp 


Arg 


Asn 


Asp 


Lys 


Leu 


Asn 


Gin 


Arg 


Asn 


Leu 


Gly 


Gin 


Asn 


Ser 




130 










135 










14 0 










Ser 


Lys 


Pro 


Leu 


Phe 


Thr 


Cys 


Asn 


Glu 


Tyr 


Val 


Pro 


Cys 


Arg 


Asn 


Arg 


145 










150 










155 










160 


Thr 


Ser 


Asn 


Tyr 


Ser 


Leu 


Gly 


Gly 


Ser 


Cys 


Tyr 


He 


His 
















165 










170 
















<210> 


30 




























<211> 


389 




























<212> 


PRT 




























<213> 


streptococcus 






















<400> 


30 


























Met 


Lys 


Ser 


Ser 


Asn 


Glu 


He 


Glu 


Lys 


Ala 


Leu 


Tyr 


Glu 


Ser 


Ser 


Asn 


1 








5 










10 










15 




Ser 


Ser 


lie 


Ser 


He 


Thr 


Lys 


Lys 


Asp 


Gly 


Lys 


Tyr 


Phe 


Asn 


He 


Asn 








20 










25 










30 






Gin 


Phe 


Lys 


Asn 


He 


Glu 


Lys 


He 


Lys 


Glu 


Val 


Glu 


Glu 


Lys 


He 


Phe 






35 










40 










45 








Gin 


Tyr 


Asp 


Gly 


Leu 


Ala 


Lys 


Leu 


Lys 


Asp 


Leu 


Lys 


Val 


Val 


Ser 


Gly 




50 










55 










60 










Glu 


Gin 


Ser 


He 


Asn 


Arg 


Glu 


Asp 


Leu 


Ser 


Asp 


Glu 


Phe 


Lys 


Asn 


Val 


65 










70 










75 










80 


Val 


Ser 


Leu 


Glu 


Ala 


Thr 


Ser 


Asn 


Thr 


Lys 


Arg 


Asn 


Leu 


Leu 


Phe 


Ser 










85 










90 










95 




Ser 


Gly 


Val 


Phe 


Ser 


Phe 


Lys 


Glu 


Gly 


Lys 


Asn 


He 


Glu 


Glu 


Asn 


Asp 








100 










105 










110 






Lys 


Asn 


Ser 


He 


Leu 


Val 


His 


Glu 


Glu 


Phe 


Ala 


Lys 


Gin 


Asn 


Lys 


Leu 






115 










120 










125 








Lys 


Leu 


Gly 


Asp 


Glu 


He 


Asp 


Leu 


Glu 


Leu 


Leu 


Asp 


Thr 


Glu 


Lys 


Ser 




130 










135 










140 










Gly 


Lys 


He 


Lys 


Ser 


His 


Lys 


Phe 


Lys 


He 


He 


Gly 


He 


Phe 


Ser 


Gly 


145 










150 










155 










160 


Lys 


Lys 


Gin 


Glu 


Thr 


Tyr 


Thr 


Gly 


Leu 


Ser 


Ser 


Asp 


Phe 


Ser 


Glu 


Asn 










165 










170 










175 




Met 


Val 


Phe 


Val 


Asp 


Tyr 


Ser 


Thr 


Ser 


Gin 


Glu 


He 


Leu 


Asn 


Lys 


Ser 








180 










185 










190 






Glu 


Asn 


Asn 


Arg 


He 


Ala 


Asn 


Lys 


He 


Leu 


Met 


Tyr 


Ser 


Gly 


Ser 


Leu 






195 










200 










205 








Glu 


Ser 


Thr 


Glu 


Leu 


Ala 


Leu 


Asn 


Lys 


Leu 


Lys 


Asp 


Phe 


Lys 


He 


Asp 




210 










215 










220 










Lys 


Ser 


Lys 


Tyr 


Ser 


He 


Lys 


Lys 


Asp 


Asn 


Lys 


Ala 


Phe 


Glu 


Glu 


Ser 


225 










230 










235 










240 


Leu 


Glu 


Ser 


Val 


Ser 


Gly 


He 


Lys 


His 


He 


He 


Lys 


He 


Met 


Thr 


Tyr 










245 










250 










255 




Ser 


lie 


Met 


Leu 


Gly 


Gly 


He 


Val 


Val 


Leu 


Ser 


Leu 


He 


Leu 


He 


Leu 








260 










265 










270 
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Trp Leu Arg Glu Arg lie Tyr Glu lie Gly lie Phe Leu Ser lie Gly 

275 280 285 

Thr Thr Lys He Gin He He Arg Gin Phe He Phe Glu Leu He Phe 

290 295 300 

lie Ser He Pro Ser He He Ser Ser Leu Phe Leu Gly Asn Leu Leu 
305 310 315 320 

Leu Lys Val He Val Glu Gly Phe He Asn Ser Glu Asn Ser Met He 

325 330 335 

Phe Gly Gly Ser Leu lie Asn Lys Ser Ser Phe Met Leu Asn He Thr 

340 345 350 

Thr Leu Ala Glu Ser Tyr Leu He Leu He Ser lie lie Val Leu Ser 

355 360 365 

Val Val Met Ala Ser Ser Leu He Leu Phe Lys Lys Pro Gin Glu He 

370 375 380 

Leu Ser Lys lie Ser 
385 



<210> 31 

<211> 169 

<212> PRT 

<213> streptococcus 



<400> 31 
Met Asp He Leu 
1 

Lys Glu Lys Val 
20 

Phe Tyr Ala He 
35 

Ser Leu Leu Ala 
50 

Lys Asn Glu Asp 
65 

Asn He Ser Leu 

Pro lie Glu Asn 
100 

Leu Phe Glu Leu 
115 

Lys Leu Ser Gly 
130 

Val Ser Asp Ala 
145 

Asp Ser Val Thr 



Glu He Lys Asn 
5 

Leu Ser Gly Val 

Val Gly Lys Ser 
40 

Gly Leu Asp Lys 
55 

He Glu Lys Lys 
70 

Val Phe Gin Asn 
85 

He Arg Leu Val 

Gly Leu Asp Lys 
120 

Gly Gin Gin Gin 
135 

Pro He He Leu 
150 

Ala Gly Glu He 
165 



Val Asn Tyr Ser 
10 

Asn Gin Lys Phe 
25 

Gly Thr Gly Lys 

Val Gin Thr Gly 
60 

Gly Tyr Ser Asn 
75 

Tyr Asn Leu He 
90 

Asn Lys Ser Val 
105 

Lys Gin lie Lys 

Arg Val Ala He 
140 

Ala Asp Glu Pro 
155 

lie 



Tyr Ala Asn Ser 
15 

Glu Leu Gly Lys 
30 

Ser Thr Leu Leu 
45 

Lys He Leu Phe 

His Arg Lys Asn 
80 

Asp Tyr Leu Ser 
95 

Asp Glu Ser He 
110 

Arg Asn Val Met 
125 

Ala Arg Ala Leu 

Thr Gly Asn Leu 
160 



<210> 32 
<211> 4171 
<212> DNA 

<213> Streptococcus 



<400> 32 

catatgacaa tatttttcaa agtctacatc acttactcgc ctgtcgtgga aaatctggca 60 

atacattaat cgaccaatta gttgctgatg gtttacttca tgcagataat cactaccatt 120 

ttttcaatgg gaagtctctg gccactttca atactaacca attgattcgc gaagttgtct 180 

atgttgaaat atccttagat actatgtcta gtggtgaaca tgatttagta aaagttaaca 240 
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ttatcagacc cactaccgag catactatcc ccacgatgat gacagctagc ccctatcatc 300 

aaggtatcaa tgatcctgcc gcagaccaaa aaacatacca aatggagggt gcgctagcag 360 

ttaaacagcc taaacacata caagttgaca caaaaccatt taaagaagaa gtaaaacatc 420 

cttcaaaatt acccatcagc cctgcaactg aaagcttcac acacattgac agttatagtc 480 

tcaatgacta ttttctttct cgtggttttg ctaatatata cgtttcaggt gtgggtactg 540 

ctggctctac gggtttcatg accagtgggg attaccaaca aatacaaagc tttaaagcag 600 

tcattgattg gttaaatggt aaggttactg cattcacaag tcataaacga gataaacaag 660 

tcaaggctga ttggtcaaac ggccttgtag caaccacagg taaatcttat ctcggtacca 720 

tgtcaactgg tttagcaaca actggcgttg aggggctgaa agtcattatc gctgaagccg 780 

caatctccac atggtatgat tattatcgag aaaatgggct tgtgtgtagt ccaggcggct 840 

accccggtga agatttagac gttttaacag aattaacata ctcacgaaac ctcttagctg 900 

gtgattacat caaaaacaac gattgctatc aagcattgtt aaatgaacaa tcaaaagcaa 960 

ttgaccgtca aagtggggat tacaaccaat actggcatga ccgtaattac ctaactcacg 1020 

tcaataatgt caaaagtcga gtagtttaca ctcatggact acaggattgg aatgttaagc 1080 

caagacatgt ctacaaagtt ttcaatgcat tgcctcaaac catcaaaaaa cacctttttt 1140 

tacatcaagg tcaacatgtg tatatgcata attggcagtc gattgatttt cgtgaaagca 1200 

tgaatgcctt actaagccaa gaactacttg gcattgacaa tcatttccaa ttagaagagg 1260 

tcatttggca agataatact actgagcaaa cttggcaagt tttagatgct ttcggaggaa 1320 

accatcaaga gcaaattggt ttaggtgata gtaaaaaact tattgataac cattatgaca 1380 

aagaagcctt tgatacttat tgtaaagact tcaatgtgtt caaaaatgat cttttcaagg 1440 

gaaataataa aaccaatcaa atcactatta atcttcctct aaagaaaaat tatctcctga 1500 

atggacagtg caaactccat ctacgtgtta aaactagtga caaaaaggcc attttatcag 1560 

cccaaatctt agactatggt cctaaaaaac gattcaaaga tacaccaacc atcaaattct 1620 

taaacagcct tgataatggt aaaaattttg ccagagaagc tttacgtgaa ctcccgttta 1680 

ctaaagatca ttatcgtgtc atcagtaaag gtgtcttgaa ccttcaaaat cgtacagact 1740 

tacttacaat tgaggctatc gagccagaac aatggtttga tatcgagttt agcctccaac 1800 

caagtatata tcaattgagt aaaggtgata atctaaggat tatcctttat acaactgatt 1860 

ttgaacatac cattcgagat aatgctagtt actctataac agtagatttg agtcaatctt 1920 

atttaactat cccaactaat caaggaaatt aacttatgaa acttcttact aaagaacggt 1980 

ttgatgattc tcaacacttt tggtaccaga tcaatttatt acaagagagt aacttcggag 2040 

cagtttttga ccatgataat aaaaacattc cacaggttgt tgcaactatt gttgatgatt 2100 

tacaaggttc cggaagttcg aatcatttct ggtattttgg caatactact gatacttcca 2160 

tccttatgat tgctcattta aatcgaaaat tctatattca ggttaattta aaggactttg 2220 

actttgcact caatttaata gctataaata attggaagag tctcctccaa actcaacttg 2280 

aagctctaaa cgatacccta gcaatatttc aataaataag gtagaatgga gtgacaaagc 2340 

aacgcgaggg agactgatta atgtcatctt attggaataa ctatcctgaa cttaaaaaaa 2400 

atattgatga aaccaatcaa ctaattcaag aaagaataca ggtcagaaat aaagatattg 2460 

aagcggcgct aagccaactc acagctgcgg gaggaaaaca gctcagacca gcattctttt 2520 

accttttttc tcaacttggt aataaggaga atcaagatac tcagcaacta aagaaaatcg 2580 

ctgcttcttt agaaatcctt cacgttgcta cattaatcca tgatgatgtc attgatgact 2640 

caccactaag acgtggaaat atgaccattc aaagcaagtt tggcaaagac atcgcagttt 2700 

atactgggga tttacttttc acagtctttt tcgatcttat tttagaatct atgactgata 2760 

caccatttat gaggattaat gcaaaatcta tgcgtaaaat tctcatggga gaattggacc 2 820 

agatgcacct tcgttacaat caacaacaag gtatccatca ctatttacgt gcgatttcag 2880 

gtaagacagc cgaactcttt aaattagcta gcaaagaagg agcttacttt ggtggtgcag 2940 

agaaggaggt tgttcgtcta gcaggccata tcggctttaa cattggtatg acattccaaa 3000 

ttttggatga tatcctggat tatactgcag ataaaaaaac atttaataag cctgtcttag 3060 

aggatttaac acaaggcgtt tacagccttc ctctacttct tgccattgaa gaaaatcctg 3120 

atattttcaa acctatttta gataaaaaaa cagatatggc tactgaagac atggaaaaaa 3180 

ttgcttatct cgtcgtttcc catagaggtg ttgacaaagc tcgccatcta gctcgtaaat 3240 

ttactgagaa agctattagt gacataaata agctacccca gaactctgca aaaaaacagt 3300 

tgctacaatt aactaattac cttttaaaac gcaaaattta aataataaaa aaacattcca 3360 

caatgctaga aaagcagtta gggaatgttt ttttattatc atttatttat cgcacctatc 3420 

aatcatcata gatcaccatc atcagcggct ttcagctgac ggtaacgttg actactttga 3480 

gacaattctt gaggagaacc ttccaactct aattgcccat tttctataaa taagatacga 3540 

tcagcatgtt caataccttt taagtgatgt gtaatccaaa ctaaggtctt accttccaat 3600 
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tctttcataa 
tcatctaaga 
ctttcgccac 
aaatcagtga 
ctagcatctt 
gcttgttgta 
ccgcctaggg 
gtactcttgc 
aaatctaaat 
acggaaaaat 



atacccttag 
taacaattgg 
ctgaaaacct 
ccatctcttt 
ggtttccaat 
tcactccaat 
taatcttccc 
cagaaccact 
gatgcaaaac 
catatggctt 



taaggcttgt 
gacatctttt 
aagtccagct 
caatccaact 
gcgaatgtta 
atagttagaa 
ttgacttgct 
ccgccctaaa 
ccatttctct 
attaggcaat 



tcagtaatag 
agtaagattc 
tcatcaacca 
cgttcaagaa 
tttagcaggg 
atgcaatcac 
ttcaagtcgc 
atagcaattt 
tgtggcttat 



gatcaagtcc 
tagccaaagc 
ttgtatagag 
ctttccatac 
ttgtattaaa 
caactattga 
cacgaagtag 
tttctccttc 
actggaaact 



aacagttggc 
aattctatgc 
accatctgat 
atcttcttca 
aaggtagggc 
aacatcagca 
actagctaag 
tttaatatcc 
taaattcttg 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4171 



<210> 33 

<211> 649 

<212> PRT 

<213> Streptococus 



<400> 33 
Tyr Asp Asn lie 
1 

Lys Ser Gly Asn 
20 

His Ala Asp Asn 
35 

Phe Asn Thr Asn 
50 

Leu Asp Thr Met 
65 

lie Arg Pro Thr 

Pro Tyr His Gin 
100 

Gin Met Glu Gly 
115 

Asp Thr Lys Pro 
130 

lie Ser Pro Ala 
145 

Asn Asp Tyr Phe 

Val Gly Thr Ala 
180 

Gin lie Gin Ser 
195 

Thr Ala Phe Thr 
210 

Ser Asn Gly Leu 
225 

Ser Thr Gly Leu 

Ala Glu Ala Ala 
260 

Leu Val Cys Ser 
275 

Thr Glu Leu Thr 
290 

Asn Asn Asp Cys 



Phe 


Gin 


Ser 


Leu 


5 








Thr 


Leu 


He 


Asp 


His 


Tyr 


His 


Phe 








40 


Gin 


Leu 


He 


Arg 






55 




Ser 


Ser 


Gly 


Glu 




70 






Thr 


Glu 


His 


Thr 


85 








Gly 


He 


Asn 


Asp 


Ala 


Leu 


Ala 


Val 








120 


Phe 


Lys 


Glu 


Glu 






135 




Thr 


Glu 


Ser 


Phe 




150 






Leu 


Ser 


Arg 


Gly 


165 








Gly 


Ser 


Thr 


Gly 


Phe 


Lys 


Ala 


Val 








200 


Ser 


His 


Lys 


Arg 






215 




Val 


Ala 


Thr 


Thr 




230 






Ala 


Thr 


Thr 


Gly 


245 








He 


Ser 


Thr 


Trp 


Pro 


Gly 


Gly 


Tyr 








280 


Tyr 


Ser 


Arg 


Asn 






295 




Tyr 


Gin 


Ala 


Leu 



His His Leu Leu 
10 

Gin Leu Val Ala 
25 

Phe Asn Gly Lys 

Glu Val Val Tyr 
60 

His Asp Leu Val 
75 

He Pro Thr Met 
90 

Pro Ala Ala Asp 
105 

Lys Gin Pro Lys 

Val Lys His Pro 
140 

Thr His He Asp 
155 

Phe Ala Asn He 
170 

Phe Met Thr Ser 
185 

He Asp Trp Leu 

Asp Lys Gin Val 
220 

Gly Lys Ser Tyr 
235 

Val Glu Gly Leu 
250 

Tyr Asp Tyr Tyr 
265 

Pro Gly Glu Asp 

Leu Leu Ala Gly 
300 

Leu Asn Glu Gin 



Ala Cys Arg Gly 
15 

Asp Gly Leu Leu 
30 

Ser Leu Ala Thr 
45 

Val Glu He Ser 

Lys Val Asn He 
80 

Met Thr Ala Ser 
95 

Gin Lys Thr Tyr 
110 

His He Gin Val 
125 

Ser Lys Leu Pro 

Ser Tyr Ser Leu 
160 

Tyr Val Ser Gly 
175 

Gly Asp Tyr Gin 
190 

Asn Gly Lys Val 
205 

Lys Ala Asp Trp 

Leu Gly Thr Met 
240 

Lys Val He He 
255 

Arg Glu Asn Gly 
270 

Leu Asp Val Leu 
285 

Asp Tyr He Lys 
Ser Lys Ala He 
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305 










310 










315 










320 


Asp 


Arg 


Gin 


Ser 


Gly 


Asp 


Tyr 


Asn 


Gin 


Tyr 


Trp 


His 


Asp 


Arg 


Asn 


Tyr 










325 










330 










335 




Leu 


Thr 


His 


Val 


Asn 


Asn 


Val 


Lys 


Ser 


Arg 


Val 


Val 


Tyr 


Thr 


His 


Gly 








340 










345 










350 






Leu 


Gin 


Asp 


Trp 


Asn 


Val 


Lys 


Pro 


Arg 


His 


Val 


Tyr 


Lys 


Val 


Phe 


Asn 






355 










360 










365 








Ala 


Leu 


Pro 


Gin 


Thr 


He 


Lys 


Lys 


His 


Leu 


Phe 


Leu 


His 


Gin 


Gly 


Gin 




370 










375 










380 










His 


Val 


Tyr 


Met 


His 


Asn 


Trp 


Gin 


Ser 


He 


Asp 


Phe 


Arg 


Glu 


Ser 


Met 


385 










390 










395 
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Asn 


Ala 


Leu 


Leu 


Ser 


Gin 


Glu 


Leu 


Leu 


Gly 


He 


Asp 


Asn 


His 


Phe 


Gin 










405 










410 










415 




Leu 


Glu 


Glu 


Val 


He 


Trp 


Gin 


Asp 


Asn 


Thr 


Thr 


Glu 


Gin 


Thr 


Trp 


Gin 








420 










425 










430 






Val 


Leu 


Asp 


Ala 


Phe 


Gly 


Gly 


Asn 


His 


Gin 


Glu 


Gin 


He 


Gly 


Leu 


Gly 






435 










440 










445 








Asp 


Ser 


Lys 


Lys 


Leu 


He 


Asp 


Asn 


His 


Tyr 


Asp 


Lys 


Glu 


Ala 


Phe 


Asp 




450 










455 










460 










Thr 


Tyr 


Cys 


Lys 


Asp 


Phe 


Asn 


Val 


Phe 


Lys 


Asn 


Asp 


Leu 


Phe 


Lys 


Gly 


465 










470 










475 










480 


Asn 


Asn 


Lys 


Thr 


Asn 


Gin 


He 


Thr 


He 


Asn 


Leu 


Pro 


Leu 


Lys 


Lys 


Asn 










485 










490 










495 




Tyr 


Leu 


Leu 


Asn 


Gly 


Gin 


Cys 


Lys 


Leu 


His 


Leu 


Arg 


Val 


Lys 


Thr 


Ser 








500 










505 










510 






Asp 


Lys 


Lys 


Ala 


He 


Leu 


Ser 


Ala 


Gin 


He 


Leu 


Asp 


Tyr 


Gly 


Pro 


Lys 






515 










520 










525 








Lys 


Arg 


Phe 


Lys 


Asp 


Thr 


Pro 


Thr 


He 


Lys 


Phe 


Leu 


Asn 


Ser 


Leu 


Asp 




530 










535 










540 










Asn 


Gly 


Lys 


Asn 


Phe 


Ala 


Arg 


Glu 


Ala 


Leu 


Arg 


Glu 


Leu 


Pro 


Phe 


Thr 


545 










550 










555 










560 


Lys 


Asp 


His 


Tyr 


Arg 


Val 


He 


Ser 


Lys 


Gly 


Val 


Leu 


Asn 


Leu 


Gin 


Asn 










565 










570 










575 






Thr 


Asp 


Leu 


Leu 


Thr 


He 


Glu 


Ala 


He 


Glu 


Pro 


Glu 


Gin 




Phe 








580 










585 










590 






Asp 


He 


Glu 


Phe 


Ser 


Leu 


Gin 


Pro 


Ser 


He 


Tyr 


Gin 


Leu 


Ser 


Lys 


Gly 






595 










600 










605 








Asp 


Asn 


Leu 


Arg 


He 


He 


Leu 


Tyr 


Thr 


Thr 


Asp 


Phe 


Glu 


His 


Thr 


He 




610 










615 










62 0 










Arg 


Asp 


Asn 


Ala 


Ser 


Tyr 


Ser 


He 


Thr 


Val 


Asp 


Leu 


Ser 


Gin 


Ser 


Tyr 


625 










630 










635 










640 


Leu 


Thr 


He 


Pro 


Thr 


Asn 


Gin 


Gly 


Asn 
























645 


























<210> 


34 




























<211> 


119 




























<212> 


PRT 




























<213> 


Streptococus 






















<400> 


34 


























Met 


Lys 


Leu 


Leu 


Thr 


Lys 


Glu 


Arg 


Phe 


Asp 


Asp 


Ser 


Gin 


His 


Phe 


Trp 


1 








5 










10 










15 




Tyr 


Gin 


He 


Asn 


Leu 


Leu 


Gin 


Glu 


Ser 


Asn 


Phe 


Gly 


Ala 


Val 


Phe 


Asp 








20 










25 










30 






His 


Asp 


Asn 


Lys 


Asn 


He 


Pro 


Gin 


Val 


Val 


Ala 


Thr 


He 


Val 


Asp 


Asp 






35 










40 










45 
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Leu Gin Gly Ser Gly Ser Ser Asn His Phe Trp Tyr Phe Gly Asn Thr 

50 55 60 

Thr Asp Thr Ser lie Leu Met lie Ala His Leu Asn Arg Lys Phe Tyr 
65 70 75 80 

lie Gin Val Asn Leu Lys Asp Phe Asp Phe Ala Leu Asn Leu lie Ala 

85 90 95 

lie Asn Asn Trp Lys Ser Leu Leu Gin Thr Gin Leu Glu Ala Leu Asn 

100 105 110 

Asp Thr Leu Ala lie Phe Gin 
115 

<210> 35 
<211> 326 
<212> PRT 

<213> Streptococus 
<400> 35 

Met Ser Ser Tyr Trp Asn Asn Tyr Pro Glu Leu Lys Lys Asn lie Asp 

15 10 15 

Glu Thr Asn Gin Leu lie Gin Glu Arg lie Gin Val Arg Asn Lys Asp 

20 25 30 

lie Glu Ala Ala Leu Ser Gin Leu Thr Ala Ala Gly Gly Lys Gin Leu 

35 40 45 

Arg Pro Ala Phe Phe Tyr Leu Phe Ser Gin Leu Gly Asn Lys Glu Asn 

50 55 60 

Gin Asp Thr Gin Gin Leu Lys Lys lie Ala Ala Ser Leu Glu lie Leu 
65 70 75 80 

His Val Ala Thr Leu lie His Asp Asp Val lie Asp Asp Ser Pro Leu 

85 90 95 

Arg Arg Gly Asn Met Thr lie Gin Ser Lys Phe Gly Lys Asp lie Ala 

100 105 110 

Val Tyr Thr Gly Asp Leu Leu Phe Thr Val Phe Phe Asp Leu lie Leu 

115 120 125 

Glu Ser Met Thr Asp Thr Pro Phe Met Arg lie Asn Ala Lys Ser Met 

130 135 140 

Arg Lys lie Leu Met Gly Glu Leu Asp Gin Met His Leu Arg Tyr Asn 
145 150 155 160 

Gin Gin Gin Gly lie His His Tyr Leu Arg Ala lie Ser Gly Lys Thr 

165 170 175 

Ala Glu Leu Phe Lys Leu Ala Ser Lys Glu Gly Ala Tyr Phe Gly Gly 

180 185 190 

Ala Glu Lys Glu Val Val Arg Leu Ala Gly His He Gly Phe Asn He 

195 200 205 

Gly Met Thr Phe Gin He Leu Asp Asp He Leu Asp Tyr Thr Ala Asp 

210 215 220 

Lys Lys Thr Phe Asn Lys Pro Val Leu Glu Asp Leu Thr Gin Gly Val 
225 230 235 240 

Tyr Ser Leu Pro Leu Leu Leu Ala He Glu Glu Asn Pro Asp He Phe 

245 250 255 

Lys Pro He Leu Asp Lys Lys Thr Asp Met Ala Thr Glu Asp Met Glu 

260 265 270 

Lys He Ala Tyr Leu Val Val Ser His Arg Gly Val Asp Lys Ala Arg 

275 280 285 

His Leu Ala Arg Lys Phe Thr Glu Lys Ala He Ser Asp He Asn Lys 

290 295 300 

Leu Pro Gin Asn Ser Ala Lys Lys Gin Leu Leu Gin Leu Thr Asn Tyr 



WO 99/42588 



PCT/CA99/00114 



57/63 



305 310 
Leu Leu Lys Arg Lys lie 
325 



315 



320 



<210> 36 

<211> 247 

<212> PRT 

<213> Streptococus 

<400> 36 



Leu 


Pro 


Asn 


Lys 


Pro 


Tyr 


Asp 


Phe 


Ser 


Val 


Lys 


Asn 


Leu 


Ser 


Phe 


Gin 


1 








5 










10 










15 




Tyr 


Lys 


Pro 


Gin 


Glu 


Lys 


Trp 


Val 


Leu 


His 


His 


Leu 


Asp 


Leu 


Asp 


He 








20 










25 










30 






Lys 


Glu 


Gly 


Glu 


Lys 


He 


Ala 


He 


Leu 


Gly 


Arg 


Ser 


Gly 


Ser 


Gly 


Lys 






35 










40 










45 








Ser 


Thr 


Leu 


Ala 


Ser 


Leu 


Leu 


Arg 


Gly 


Asp 


Leu 


Lys 


Ala 


Ser 


Gin 


Gly 




50 










55 










60 










Lys 


lie 


Thr 


Leu 


Gly 


Gly 


Ala 


Asp 


Val 


Ser 


He 


Val 


Gly 


Asp 


Cys 


He 


65 










70 










75 










80 


Ser 


Asn 


Tyr 


He 


Gly 


Val 


He 


Gin 


Gin 


Ala 


Pro 


Tyr 


Leu 


Phe 


Asn 


Thr 










85 










90 










95 




Thr 


Leu 


Leu 


Asn 


Asn 


He 


Arg 


He 


Gly 


Asn 


Gin 


Asp 


Ala 


Ser 


Glu 


Glu 








100 










105 










110 






Asp 


Val 


Trp 


Lys 


Val 


Leu 


Glu 


Arg 


Val 


Gly 


Leu 


Lys 


Glu 


Met 


Val 


Thr 






115 










120 










125 








Asp 


Leu 


Ser 


Asp 


Gly 


Leu 


Tyr 


Thr 


Met 


Val 


Asp 


Glu 


Ala 


Gly 


Leu 


Arg 




130 










135 










140 










Phe 


Ser 


Gly 


Gly 


Glu 


Arg 


His 


Arg 


He 


Ala 


Leu 


Ala 


Arg 


He 


Leu 


Leu 


145 










150 










155 










160 


Lys 


Asp 


Val 


Pro 


He 


Val 


He 


Leu 


Asp 


Glu 


Pro 


Thr 


Val 


Gly 


Leu 


Asp 










165 










170 










175 




Pro 


He 


Thr 


Glu 


Gin 


Ala 


Leu 


Leu 


Arg 


Val 


Phe 


Met 


Lys 


Glu 


Leu 


Glu 








180 










185 










190 






Gly 


Lys 


Thr 


Leu 


Val 


Trp 


He 


Thr 


His 


His 


Leu 


Lys 


Gly 


He 


Glu 


His 






195 










200 










205 








Ala 


Asp 


Arg 


He 


Leu 


Phe 


He 


Glu 


Asn 


Gly 


Gin 


Leu 


Glu 


Leu 


Glu 


Gly 




210 










215 










220 










Ser 


Pro 


Gin 


Glu 


Leu 


Ser 


Gin 


Ser 


Ser 


Gin 


Arg 


Tyr 


Arg 


Gin 


Leu 


Lys 


225 










230 










235 










240 


Ala 


Ala 


Asp 


Asp 


Gly 


Asp 


Leu 





















245 

<210> 37 
<211> 3480 
<212> DNA 

<213> Streptococcus 



<400 
aattctattt 
ttaggtggta 
attgataatg 
tatgctgttc 
ggagctggag 
catgcggatg 
gaaaaagtca 



> 37 
ggaggttttt 
tggttttatc 
gtagaattac 
cgactggaac 
ccaacttttc 
gaatgcatag 
aacaaggaga 



cttgaataaa 
tgcgggttcc 
aacaggtttc 
gattattagg 
ttggatgaca 
tggttacgct 
tatcatcggt 



tggttagtta 
cgagttttag 
aatggttatc 
gcagtggcag 
gacttagcag 
catatgtcac 
tacgtaggag 



aggcaagttc 
cggatactta 
ctggacattg 
atggtactgt 
gaaattgtgt 
gtgtggtggc 
caactggtat 



cttagttgtt 
tgtccgtcca 
tggggtggat 
gaaatttgca 
catgattcaa 
taggactggg 
ggcgacggga 



60 
120 
180 
240 
300 
360 
420 
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cctcaccttc 
cgtatcaatc 
tcagctccaa 
aaagtgtatc 
accctaacgc 
gaggttgatg 
atctttaatc 
acttgggcta 
caagaactgc 
tactaactaa 
tgatattata 
aaaaggtact 
cacaagaaac 
taaagcaaga 
cagaagcaat 
atcttattta 
cttcaatgaa 
atttgaaaac 
aaggtatgac 
cgccagcttt 
ctaatgaaca 
aagaggaagt 
ctgttgccgc 
cccctagagt 
agcatgtatc 
tacaagcgac 
tagcacaacc 
aacctcatgt 
gtacataccg 
taggtaaaaa 
caaataacat 
atggacctgc 
atgaccatgt 
ttttatatgc 
ttaagctata 
taaaatctaa 
tagatatcta 
cttttaaaaa 
ctctttatac 
gagcggttgg 
atttacctat 
ctacgatgac 
gtacacttag 
ggaaatatcc 
cttttgaagc 
gccaagaaga 
tgattgcgga 
ttgcaggaat 
tctcaggact 
gttatgagta 
tttgcttgtg 



attttgaatt 
caacgtcact 
gcattaagcc 
gagtagatga 
cgactgggtt 
ctaatggtaa 
ctaaaactct 
agacacgctt 
tttacaaata 
gtacaatttc 
gtctcggact 
attgacatcg 
agatacgacg 
caataaatca 
gtcaattgat 
tcctgagaca 
aatagaaaca 
caatcaagtt 
accagaagca 
gaaatcaaaa 
ggtatcaaca 
taaaccaact 
tgaaacacca 
ggcaagtgtt 
agctccagca 
tgaagttaag 
agcttcaaca 
tgcagcttat 
tgcaggtgat 
ccaagcactt 
ttcatatgtt 
taatacttgg 
tcacgtatca 
cttgaataga 
agtctgaaac 
gtctgtaaag 
aaattaaaaa 
gcaaataaaa 
tgaaagtgga 
tataagagcc 
catcggcatt 
agaggtggat 
agagcgtcat 
tgaacagttg 
aggagttgat 
aggaccggat 
aggtaaaatt 
tgtagttggt 
tagttaaaag 
tcaaaaaaag 
tagacagtag 



tttaccagct 
aattgctaac 
attacaatca 
attacaaaag 
tgattggaac 
tttgacagct 
taagactgta 
tgctaatggt 
gtttgaggta 
tttaaaccgt 
aataaaaagg 
acaatggcag 
tggacagcac 
tcatatactg 
atgaatgtct 
acactgacag 
ccagcaacaa 
tctgttgcag 
gcaacaacga 
gaagtattag 
gctcctgtga 
cagacgtcag 
gctccagtag 
aaagtagtca 
gttcctgtga 
agcgttccgg 
acaaatgcag 
aaagaaaaag 
ccaggtgatc 
ggtaatgaag 
atctggcaac 
aatgcaatgc 
tttaacaaat 
ctttcaaggt 
tactttcacg 
attattgaaa 
agataaggaa 
aatggcatta 
ggtgttatgc 
aatagtgtcc 
attaaacgtg 
caattagcta 
gatggtttga 
ctaatggctg 
tttgtgggta 
atagaactcc 
catactccta 
ggtgctatca 
tgttactcaa 
tgtgattggt 
tcaatctgtt 



aaccctaatt 
gttgcgacct 
gctcctgtac 
gttaatggtg 
gataatggta 
gaccaggttc 
gaaaaaccca 
agttcagttt 
ttgattcatt 
ctgaaaataa 
aaataggaat 
cttcgctatt 
gtactgtttc 
tgaaatatgg 
tagcaaaaat 
taacttacga 
atgctgctgg 
accaaaaagt 
ttgtttcgcc 
cacaagagca 
agtcgattac 
tcagtcagtc 
ctaaagtagc 
ctcctaaagt 
ctacgacttc 
tagcacaaaa 
tagctgcaca 
tagcgtcaac 
atggtaaagg 
ttgcacagta 
aaaagtttta 
cagatcgtgg 
aatataaaaa 
tcttatctaa 
ttaaccgtga 
acgctttaaa 
gtgagaatat 
ttgtgtcatg 
ctcttttagc 
gcgacattaa 
aatatcctcc 
gtttagatat 
gtgtagctga 
atataagtac 
caactctatc 
ttaataagct 
agcaagctaa 
ctagaccaaa 
aaatcaaaat 
ttttgcctcg 
atggctgccg 



ttcaaaatgg 
ttagtggaaa 
agaatcaatc 
tttggttagt 
taccagcatc 
ttcaaaaagg 
tccaaggaac 
ggcttcgcgt 
gttttaaatg 
ttttatagtc 
tgaagcaatg 
atcagtcgca 
agaggtaaag 
tgatacacta 
taataacatt 
tcagaagagt 
tcaaacaaca 
ttctctcaat 
aatgaagaca 
agctgttagt 
ttcagaagtt 
aacaacagta 
accggtaaga 
agaaactggt 
aacagctaca 
agctccaaca 
tcctgaaaat 
ttatggagtt 
tttagcagtc 
ctctacacaa 
ctcaaataca 
tggcgttact 
aggaagctat 
tttttattaa 
ctaaatcaaa 
aacagatata 
gccacatcta 
tcaagctttg 
tttggcagct 
ggaaattcaa 
acaagaacca 
tgcagtaata 
gtttattcaa 
ttttgaagaa 
tggatacaca 
ttgtcaagcc 
tgaaattaat 
agaaatagcg 
caaaataaaa 
gagctgcagc 
agaaggataa 



tttccatgga 
aacgcaagca 
tagtaaatta 
caaaaataac 
agaaattgat 
tggttacttt 
agctggttta 
tgacaacagt 
acagttttgt 
cagtaaagtg 
aaaatgaata 
agtgttcaag 
gctgatttgg 
agcgttattt 
gcagatatca 
catactgcca 
gctactgtgg 
acaatttcgg 
tattcttctg 
caagcagcag 
ccagcagcta 
tcaccagctt 
actgtagcag 
gcatcaccag 
gacagtaagt 
gcaacaccgg 
gcagggctcc 
aatgaattca 
gactttattg 
aatatggcag 
aatagtattt 
gccaaccatt 
ttggcttctt 
attgaggaga 
acgttaaaac 
ataaggtttg 
agtaaagaag 
cctggggagc 
caagaagcag 
gaagttacta 
tttatcactg 
gccttagatt 
aagataaaag 
ggtaaaaatg 
gattacagcc 
ggtatagatg 
catataggtg 
gagcgtttca 
aaggggaata 
attatcaatg 
agtcgaaatt 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



<210> 38 
<211> 306 
<212> PRT 

<213> Streptococcus 
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Ala 
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Val 
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3 5 










40 


Gly 


Phe 
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Gly Tyr 


Pro 


Gly 


His 




50 
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Thr 


Gly 


Thr 


He 


He 


Arg 


Ala 


Val 


65 










70 






Gly 


Ala 


Gly 


Ala 


Asn 


Phe 


Ser 


Trp 










85 








Val 


Met 


He 


Gin 


His 


Ala 


Asp 


Gly 








100 










Ser 


Arg 


Val 


Val 


Ala 


Arg 


Thr 


Gly 
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120 
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Gly 


Tyr 


Val 


Gly Ala 


Thr 


Gly 




130 










135 




Phe 


Glu 


Phe 


Leu 


Pro 


Ala 


Asn 


Pro 


145 










150 






Arg 


lie 


Asn 


Pro 


Thr 


Ser 


Leu 


He 










165 








Lys 


Thr 


Gin 


Ala 


Ser 


Ala 


Pro 


Ser 








180 










Val 


Gin 


Asn 


Gin 


Ser 


Ser 


Lys 


Leu 






195 










200 


Gin 


Lys 


Val 


Asn 


Gly 


Val 


Trp 


Leu 




210 










215 




Thr 


Gly 


Phe 


Asp 


Trp 


Asn 


Asp 


Asn 


225 










230 






Glu 


Val 


Asp 


Ala 


Asn 


Gly Asn 


Leu 










245 








Gly 


Gly 


Tyr 


Phe 


He 


Phe 


Asn 


Pro 








260 










Pro 


He 


Gin 


Gly Thr Ala 


Gly 


Leu 






275 










280 


Asn 


Gly 


Ser 


Ser 


Val 


Trp 


Leu 


Arg 




290 










295 




Tyr 


Lys 
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39 












<211> 


434 












<212> 


PRT 












<213> 


Streptococcus 
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Met 


Lys 


Met 


Asn 


Lys 


Lys 


Val 


Leu 
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5 








Leu 


Leu 


Ser 


Val 


Ala 


Ser 


Val 


Gin 








20 










Thr 


Ala 


Arg 


Thr 


Val 


Ser 


Glu 


Val 






35 










40 


Asn 


Lys 


Ser 


Ser 


Tyr 


Thr 


Val 


Lys 
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Lys 


Trp 


Leu 


Val 


Lys 


Ala 


Ser 




10 










15 




Val 


Leu 


Ser 


Ala 


Gly 


Ser 


Arg 


Val 


25 










30 






He 


Asp 


Asn 


Gly Arg 


He 


Thr 


Thr 










45 








Cys 


Gly Val 


Asp 


Tyr 


Ala 


Val 


Pro 








60 










Ala 


Asp 


Gly 


Thr 


Val 


Lys 


Phe 


Ala 






75 










80 


Met 


Thr 


Asp 


Leu 


Ala 


Gly 


Asn 


Cys 




90 










95 




Met 


His 


Ser 


Gly 


Tyr 


Ala 


His 


Met 


105 










110 






Glu 


Lys 


Val 


Lys 


Gin Gly Asp 


He 










125 








Met 


Ala 


Thr 


Gly 


Pro 


His 


Leu 


His 








140 










Asn 


Phe 


Gin 


Asn 


Gly 


Phe 


His 


Gly 






155 










160 


Ala 


Asn 


Val 


Ala 


Thr 


Phe 


Ser 


Gly 




170 










175 




He 


Lys 


Pro 


Leu 


Gin 


Ser 


Ala 


Pro 


185 










190 






Lys 


Val 


Tyr 


Arg 


Val 


Asp 


Glu 


Leu 










205 








Val 


Lys 


Asn 


Asn 


Thr 


Leu 


Thr 


Pro 








220 










Gly 


He 


Pro 


Ala 


Ser 


Glu 


He 


Asp 






235 










240 


Thr 


Ala 


Asp 


Gin 


Val 


Leu 


Gin 


Lys 




250 










255 




Lys 


Thr 


Leu 


Lys 


Thr 


Val 


Glu 


Lys 


265 










270 






Thr 


Trp 


Ala 


Lys 


Thr 


Arg 


Phe 


Ala 










285 








Val 


Asp 


Asn 


Ser 


Gin 


Glu 


Leu 


Leu 








300 










Leu 


Thr 


Ser 


Thr 


Met 


Ala 


Ala 


Ser 




10 










15 




Ala 


Gin 


Glu 


Thr 


Asp 


Thr 


Thr 


Trp 


25 










30 






Lys 


Ala 


Asp 


Leu 


Val 


Lys 


Gin 


Asp 










45 








Tyr 


Gly Asp 


Thr 


Leu 


Ser 


Val 


He 
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50 










55 










60 










Ser 


Glu 


Ala 


Met 


Ser 


He 


Asp 


Met 


Asn 


Val 


Leu 


Ala 


Lys 


He 


Asn 


Asn 


65 










70 










75 










80 


He 


Ala 


Asp 


He 


Asn 


Leu 


He 


Tyr 


Pro 


Glu 


Thr 


Thr 


Leu 


Thr 


Val 


Thr 










85 










90 










95 




Tyr 


Asp 


Gin 


Lys 


Ser 


His 


Thr 


Ala 


Thr 


Ser 


Met 


Lys 


He 


Glu 


Thr 


Pro 








100 










105 










110 






Ala 


Thr 


Asn 


Ala 


Ala 


Gly 


Gin 


Thr 


Thr 


Ala 


Thr 


Val 


Asp 


Leu 


Lys 


Thr 
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120 










125 








Asn 


Gin 


Val 


Ser 


Val 


Ala 


Asp 


Gin 


Lys 


Val 


Ser 


Leu 


Asn 


Thr 


He 


Ser 
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140 
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Met 


Thr 


Pro 


Glu 


Ala 


Ala 


Thr 


Thr 


He 


Val 


Ser 


Pro 


Met 


Lys 


145 










150 










155 
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Ser 


Ala 
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Ala 
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Lys 


Ser 


Lys 


Glu 


Val 


Leu 


Ala 


Gin 
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Glu 


Gin 


Ala 


Val 


Ser 


Gin 


Ala 


Ala 


Ala 


Asn 


Glu 


Gin 


Val 


Ser 


Thr 


Ala 
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Thr 


Ser 


Glu 


Val 
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Ala 


Ala 


Lys 


Glu 


Glu 


Val 
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Lys 
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Gin 


Thr 


Ser 


Val 


Ser 


Gin 


Ser 


Thr 


Thr 


Val 


Ser 


Pro 


Ala 




210 










215 










220 










Ser 


Val 


Ala 


Ala 


Glu 


Thr 


Pro 


Ala 


Pro 


Val 


Ala 


Lys 


Val 


Ala 


Pro 


Val 


225 










230 










235 










240 


Arg 


Thr 


Val 


Ala 


Ala 


Pro 


Arg 


Val 


Ala 


Ser 


Val 


Lys 


Val 


Val 


Thr 


Pro 










245 










250 










255 




Lys 


Val 


Glu 


Thr 


Gly Ala 


Ser 


Pro 


Glu 


His 


Val 


Ser 


Ala 


Pro 


Ala 


Val 








260 










265 










270 






Pro 


Val 


Thr 


Thr 


Thr 


Ser 


Thr 


Ala 


Thr 


Asp 


Ser 


Lys 


Leu 


Gin 


Ala 


Thr 






275 










280 










285 








Glu 


Val 


Lys 


Ser 


Val 


Pro 


Val 


Ala 


Gin 


Lys 


Ala 


Pro 


Thr 


Ala 


Thr 


Pro 




290 










295 










300 










Val 


Ala 


Gin 


Pro 


Ala 


Ser 


Thr 


Thr 


Asn 


Ala 


Val 


Ala 


Ala 


His 


Pro 


Glu 
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Ala 


Gly 
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Gin 


Pro 


His 


Val 


Ala 


Ala 


Tyr 


Lys 


Glu 


Lys 


Val 


Ala 
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335 




Ser 


Thr 


Tyr 


Gly 


Val 


Asn 


Glu 


Phe 


Ser 


Thr 


Tyr 


Arg 


Ala 


Gly 


Asp 


Pro 








340 










345 
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Gly 


Asp 


His 


Gly 


Lys 


Gly 


Leu 


Ala 


Val 


Asp 


Phe 


He 


Val 


Gly 


Lys 


Asn 
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360 
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Gin 


Ala 


Leu 


Gly 


Asn 


Glu 


Val 


Ala 


Gin 
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Ser 


Thr 


Gin 


Asn 


Met 


Ala 
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380 










Ala 


Asn 


Asn 


He 


Ser 


Tyr 


Val 


He 


Trp 


Gin 


Gin 


Lys 


Phe 


Tyr 


Ser 


Asn 
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390 
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400 


Thr 


Asn 


Ser 


He 


Tyr 


Gly 


Pro 


Ala 


Asn 


Thr 


Trp 


Asn 


Ala 


Met 


Pro 


Asp 










405 










410 










415 




Arg 


Gly 


Gly Val 


Thr 


Ala 


Asn 


His 


Tyr 


Asp 


His 


Val 


His 


Val 


Ser 


Phe 



420 425 430 



Asn Lys 



<210> 40 

<211> 232 

<212> PRT 

<213> Streptococcus 



<400> 40 
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Met 


Pro 


His 


Leu 


Ser 


Lys 


Glu 


Ala 


Phe 


Lys 


Lys 


Gin 


He 


Lys 


Asn 


Gly 


1 
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15 
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Val 


Ser 
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Gin 


Ala 


Leu 


Pro 
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Pro 


Leu 
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Thr 


Glu 








2 0 
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Val 
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Ala 


Gly 






3 5 
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Arg 
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Val 
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He 
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Glu 


He 


Gin 




50 










55 










60 










Glu 


Val 


Thr 


Asn 


Leu 


Pro 


He 


He 


Gly 


lie 


He 


Lys 


Arg 


Glu 


Tyr 


Pro 
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Pro 
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Val 
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Ala 


Leu 


Asp 
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Leu 


Arg 


Glu 
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Arg 
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Asp 


Gly 
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Val 


Ala 


Glu 


Phe 


He 


Gin 


Lys 


He 
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Gly 
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120 
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Glu 


Gin 


Leu 


Leu 


Met 


Ala 
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He 


Ser 


Thr 


Phe 


Glu 


Glu 




130 










135 










140 










Gly 


Lys 


Asn 


Ala 


Phe 


Glu 


Ala 


Gly 


Val 


Asp 


Phe 


Val 


Gly 


Thr 


Thr 


Leu 


145 










150 










155 
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Ser 


Gly 


Tyr 


Thr 


Asp 


Tyr 


Xaa 


Arg 


Gin 


Glu 


Glu 


Gly 


Pro 


Asp 


He 


Glu 










165 










170 










175 




Leu 


Leu 


Asn 


Lys 


Leu 


Cys 


Gin 


Ala 


Gly 


He 


Asp 


Val 


He 


Ala 


Glu 


Gly 
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185 










190 
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His 
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Gin 


Ala 


Asn 


Glu 


He 


Asn 


His 


He 


Gly 


Val 
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205 








Ala 


Gly 


He 


Val 


Val 


Gly 


Gly 


Ala 


He 


Thr 


Arg 
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He 


Ala 




210 










215 










220 










Glu 


Arg 


Phe 


He 


Ser 


Gly 


Leu 
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225 230 



<210> 41 
<211> 39 
<212> PRT 

<213> Streptococcus 



<400> 41 
Met Ser He Lys Lys Ser Val He 

1 5 
Leu Ser Met Phe Ala Cys Val Asp 
2 0 

Glu Lys Asp Lys Val Glu He 
35 



Gly Phe Cys Leu Gly Ala Ala Ala 

10 15 
Ser Ser Gin Ser Val Met Ala Ala 
25 30 



<210> 42 
<211> 1305 
<212> DNA 

<213> Streptococcus 



<400> 42 

atgaaaatga ataaaaaggt actattgaca 
gcaagtgttc aagcacaaga aacagatacg 
aaggctgatt tggtaaagca agacaataaa 
ctaagcgtta tttcagaagc aatgtcaatt 
attgcagata tcaatcttat ttatcctgag 
agtcatactg ccacttcaat gaaaatagaa 



tcgacaatgg 


cagcttcgct 


attatcagtc 


60 


acgtggacag 


cacgtactgt 


ttcagaggta 


12 0 


tcatcatata 


ctgtgaaata 


tggtgataca 


180 


gatatgaatg 


tcttagcaaa 


aattaataac 


240 


acaacactga 


cagtaactta 


cgatcagaag 


300 


acaccagcaa 


caaatgctgc 


tggtcaaaca 


360 
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acagctactg 
aatacaattt 
acatattctt 
agtcaagcag 
gttccagcag 
gtatcaccag 
agaactgtag 
ggtgcatcac 
acagacagta 
acagcaacac 
aatgcagggc 
gttaatgaat 
gtcgacttta 
caaaatatgg 
acaaatagta 
actgccaacc 



tggatttgaa 
cggaaggtat 
ctgcgccagc 
cagctaatga 
ctaaagagga 
cttctgttgc 
cagcccctag 
cagagcatgt 
agttacaagc 
cggtagcaca 
tccaacctca 
tcagtacata 
ttgtaggtaa 
cagcaaataa 
tttatggacc 
attatgacca 



aaccaatcaa 
gacaccagaa 
tttgaaatca 
acaggtatca 
agttaaacca 
cgctgaaaca 
agtggcaagt 
atcagctcca 
gactgaagtt 
accagcttca 
tgttgcagct 
ccgtgcaggt 
aaaccaagca 
catttcatat 
tgctaatact 
tgttcacgta 



gtttctgttg 
gcagcaacaa 
aaagaagtat 
acagctcctg 
actcagacgt 
ccagctccag 
gttaaagtag 
gcagttcctg 
aagagcgttc 
acaacaaatg 
tataaagaaa 
gatccaggtg 
cttggtaatg 
gttatctggc 
tggaatgcaa 
tcatttaaca 



cagaccaaaa 
cgattgtttc 
tagcacaaga 
tgaagtcgat 
cagtcagtca 
tagctaaagt 
tcactcctaa 
tgactacgac 
cggtagcaca 
cagtagctgc 
aagtagcgtc 
atcatggtaa 
aagttgcaca 
aacaaaagtt 
tgccagatcg 
aataa 



agtttctctc 
gccaatgaag 
gcaagctgtt 
tacttcagaa 
gtcaacaaca 
agcaccggta 
agtagaaact 
ttcaacagct 
aaaagctcca 
acatcctgaa 
aacttatgga 
aggtttagca 
gtactctaca 
ttactcaaat 
tggtggcgtt 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1305 



<210> 43 
<211> 1230 
<212> DNA 

<213> Streptococcus 
<400> 43 

caagaaacag atacgacgtg gacagcacgt actgtttcag aggtaaaggc tgatttggta 60 

aagcaagaca ataaatcatc atatactgtg aaatatggtg atacactaag cgttatttca 120 

gaagcaatgt caattgatat gaatgtctta gcaaaaatta ataacattgc agatatcaat 180 

cttatttatc ctgagacaac actgacagta acttacgatc agaagagtca tactgccact 240 

tcaatgaaaa tagaaacacc agcaacaaat gctgctggtc aaacaacagc tactgtggat 300 

ttgaaaacca atcaagtttc tgttgcagac caaaaagttt ctctcaatac aatttcggaa 360 

ggtatgacac cagaagcagc aacaacgatt gtttcgccaa tgaagacata ttcttctgcg 420 

ccagctttga aatcaaaaga agtattagca caagagcaag ctgttagtca agcagcagct 480 

aatgaacagg tatcaacagc tcctgtgaag tcgattactt cagaagttcc agcagctaaa 540 

gaggaagtta aaccaactca gacgtcagtc agtcagtcaa caacagtatc accagcttct 600 

gttgccgctg aaacaccagc tccagtagct aaagtagcac cggtaagaac tgtagcagcc 660 

cctagagtgg caagtgttaa agtagtcact cctaaagtag aaactggtgc atcaccagag 720 

catgtatcag ctccagcagt tcctgtgact acgacttcaa cagctacaga cagtaagtta 780 

caagcgactg aagttaagag cgttccggta gcacaaaaag ctccaacagc aacaccggta 840 

gcacaaccag cttcaacaac aaatgcagta gctgcacatc ctgaaaatgc agggctccaa 900 

cctcatgttg cagcttataa agaaaaagta gcgtcaactt atggagttaa tgaattcagt 960 

acataccgtg caggtgatcc aggtgatcat ggtaaaggtt tagcagtcga ctttattgta 1020 

ggtaaaaacc aagcacttgg taatgaagtt gcacagtact ctacacaaaa tatggcagca 1080 

aataacattt catatgttat ctggcaacaa aagttttact caaatacaaa tagtatttat 1140 

ggacctgcta atacttggaa tgcaatgcca gatcgtggtg gcgttactgc caaccattat 1200 

gaccatgttc acgtatcatt taacaaataa 1230 

<210> 44 
<211> 409 
<212> PRT 

<213> Streptococcus 



<400> 44 

Gin Glu Thr Asp Thr Thr Trp Thr Ala Arg Thr Val Ser Glu Val Lys 

15 10 15 

Ala Asp Leu Val Lys Gin Asp Asn Lys Ser Ser Tyr Thr Val Lys Tyr 

20 25 30 

Gly Asp Thr Leu Ser Val lie Ser Glu Ala Met Ser lie Asp Met Asn 
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He 


Ala 


Asp 


He 


Asn 


Leu 


He 


Tyr 


Pro 
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Val 
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Asp 
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Lys 


Ser 
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Thr 


Ala 


Thr 
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Ser 


Met 
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He 


Glu 


Thr 


Pro 


Ala 
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Asn 


Ala 


Ala 


Gly 


Gin 


Thr 


Thr 
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90 










95 




Ala 


Thr 


Val 


Asp 


Leu 


Lys 


Thr 


Asn 


Gin 


Val 


Ser 


Val 


Ala 


Asp 


Gin 


Lys 








100 










105 










110 






Val 


Ser 


Leu 


Asn 


Thr 


He 


Ser 


Glu 


Gly 


Met 


Thr 


Pro 


Glu 


Ala 


Ala 


Thr 






115 










120 










125 








Thr 


lie 


Val 


Ser 


Pro 


Met 


Lys 


Thr 


Tyr 


Ser 


Ser 


Ala 


Pro 


Ala 


Leu 


Lys 




13 0 










135 










140 








Ser 


Lys 


Glu 


Val 


Leu 


Ala 


Gin 


Glu 


Gin 


Ala 


Val 


Ser 


Gin 


Ala 


Ala 


Ala 


145 










150 










155 










160 


Asn 


Glu 


Gin 


Val 


Ser 


Thr 


Ala 


Pro 


Val 


Lys 


Ser 


He 


Thr 


Ser 


Glu 


Val 










165 










170 










175 




Pro 


Ala 


Ala 


Lys 


Glu 


Glu 


Val 


Lys 


Pro 


Thr 


Gin 


Thr 


Ser 


Val 


Ser 


Gin 








180 
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Ser 


Thr 
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Val 


Ser 


Pro 


Ala 


Ser 


Val 


Ala 


Ala 


Glu 


Thr 


Pro 


Ala 


Pro 
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200 










205 








Val 


Ala 


Lys 


Val 


Ala 


Pro 


Val 


Arg 


Thr 


Val 


Ala 


Ala 


Pro 


Arg 


Val 


Ala 
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Ser 


Val 


Lys 


Val 


Val 


Thr 


Pro 


Lys 


Val 


Glu 


Thr 


Gly Ala 


Ser 


Pro 


Glu 
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240 


His 


Val 


Ser 


Ala 


Pro 


Ala 


Val 


Pro 


Val 


Thr 


Thr 


Thr 


Ser 


Thr 


Ala 


Thr 










245 










250 










255 




Asp 


Ser 


Lys 


Leu 


Gin 


Ala 


Thr 


Glu 


Val 


Lys 


Ser 


Val 


Pro 


Val 


Ala 


Gin 








260 










265 










270 






Lys 


Ala 


Pro 


Thr 


Ala 


Thr 


Pro 


Val 


Ala 


Gin 


Pro 


Ala 


Ser 


Thr 


Thr 


Asn 






275 










280 










285 








Ala 


Val 


Ala 


Ala 


His 


Pro 


Glu 


Asn 


Ala 


Gly 


Leu 


Gin 


Pro 


His 


Val 


Ala 




2 90 










2 95 
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Ala 


Tyr 


Lys 


Glu 


Lys 


Val 


Ala 


Ser 


Thr 


Tyr 


Gly 


Val 


Asn 


Glu 


Phe 


Ser 


3 05 










310 










315 










320 
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Tyr 


Arg 


Ala 


Gly 


Asp 


Pro 


Gly 


Asp 


His 


Gly 


Lys 


Gly 


Leu 


Ala 


Val 
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330 










335 




Asp 


Phe 


He 


Val 


Gly 


Lys 


Asn 


Gin 


Ala 


Leu 


Gly 


Asn 


Glu 


Val 


Ala 


Gin 








340 










345 










350 






Tyr 


Ser 


Thr 


Gin 


Asn 


Met 


Ala 


Ala 


Asn 


Asn 


He 


Ser 


Tyr 


Val 


He 


Trp 






355 
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Gin 


Gin 
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Ser 


Asn 


Thr 


Asn 
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Tyr 


Gly 


Pro 


Ala 


Asn 
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Thr 


Trp 


Asn 


Ala 


Met 


Pro 


Asp 
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Gly 
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Thr 


Ala 


Asn 


His 


Tyr 


385 










390 
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400 


Asp 


His 


Val 


His 


Val 


Ser 


Phe 


Asn 


Lys 

















405 
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